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Motif T 

TRT can WL hh hh pFTY TBp p YRKWLhl K 

8p Trtlp 4 29 WLYNSFIIPILOSrrriTBSSOUWRTVTFlUtDIWKLUrRPPrTSMKM fi 

hTRT 54 6 wiiMSvrwELmsrrrv Tft 1 r f qkwrlffykjlsvwsku)sigxrqhlk lO 

Xa_PI23 441 WIFEDLWSLIRClTTVTKOOKSVSKTYTYKKNIWDVIKKMSIADUaK 8 
8e_BsC3p 3 66 WI^FRQLIPKI I C T I f g « LTA ISSTVT- IVTyitHDTWWKI.rTPPrVEYFK 8 
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KEVEEWKKSLCFAPCKUUaTXXTT- 0 

CRNHNSYTL^NFKHSKKRIXFIOKSNN I 

p hh h R 
LST^LCTCKFRFKPKRIVNXFKPKGG 0 
;:iILRICYYPOAWKHAOVXMXLICPGKS 6 
ECKISKICPENPYNTPVFAXRKXDST 1 
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Promoter Reporter Construct 
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Seq. ID. No 7 



GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCAGACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
MFkB-MHC-I . 2 
TGGGCTTCCCC 



ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB.CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_I I_c i eavage_s ice 
RNYT'JrJCrTNG^/NGKTrrYNY 



AGGGCGCTTCCCCCGCAGGTGTCCTGCCTG.\AGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 
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! A.\.\ACCCC.\A AACCCC-^^-AA CCCCTTTTAG AGCCCTGCAG TTCGAA.ATAT 
i i AACCTCAG I A TTAATAAGCT CaGATTTTAA ATATT.AATTA C.AAAACCTA-A 
! 0 1 ATGGAGGTTG j\TG7TGATA^ TCAAGCTGAT AATCATGGCA TTCACTCAGC 
; 3 1 I CTTAaGACT TGTCAAGA.A,-\ TT.A.AAGAAGC TAAA.ACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AG AAATCAAT CTCAA-AGTCA T7ATA.AAGAT 
25 I TTAGA-^GATA TT A.AA.ATATT TGGGCAGaCA A.ATATTGTTG CTACTCCACG 
:>01 AGACiATA,aT GA-AGAAGATT TTAAaGTTAT TGC.AAGAAAA GAAGTATTTT 
3 5 1 C.A.ACTGGACT AATGATCGA-A CTTATTCACA AATGCTTAGT TGAACTTCTT 
JO 1 1 CATCA.AGCG ATGT7TCAGA TAGaCAAAAA CTTCAATGAT TTGGATTTCA 
J5I ACTTA.AGGGA AATCA.ATTAG CAAAGACCCA TTTATTAACA GCTCTTTCA.A 
iO 1 CTCAAAAGCA GTATTTCTTT CAAGaCGAAT GGAACCAAGT TAGAGCAATG 
5 5 I aTTGGAAA TG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTCCGGG.AAT AACGTTTTTG 
651 ATCAT7TGA.A AGTCAaCGAT AAGTTTCACA AAAAGC.AAA.A AGGTCGAGCA 
"0 1 GCAGACaTGA aTGAACCTCG ATGTTGATCA ACCTGCAA.AT ACA.-\TGTC.AA 
"1 GA.ATGAGAA.A.GATCACTTTC TCA.aCaaCAT CAACGTGCCG AATTCGAATA 
501 aTaTG.AAATC A.-\G.A.ACCAGA aTATTTTATT GCACTCATTTTAATAGA.A.AT 
i 5 I .^\CCAATTCT TC.AAAA.AGCA TGAGTTTGTG AGTAACAAA_A ACA-ATATTTC 
rOI AGCGATGGAC AGAGCTCAGA CGATATTCAC GA.ATATATTC AGATTTAATA 
9> I GAATTAGAAA GAAGCT.A.AA.A CATA.-i.GGTTA TCGAAA.A.A.AT TGCCTaCATG 
i 00 1 CTTGAGAAAG TCAA.AGATTT TAACTTCAAC TACTATTTAA C.AAAATCTTG 
1051 TCCTCTTCCA GAA.AATTGGC GGGA-ACCGAA ACAAAAAATC GAAA^CTTGA 
! 1 0 1 TAA.ATA.A.AAC TAG AG.-i_AG.A.A .A.AGTCCA ACT ACTATGAAGA GCTGTTTAGC 
i 1 5 I TACACAACTG ATAATA.AATG CCTCaCACAA TTTATTAATG AATTTTTCTA 
I 20 i C.AATATACTC CCCA A.AGACT TTTTC ACTGG AAG AAACCGT A.AGA ATTTTC 
! 25 I A.AAAGAA.'^GT T.AAGAA-ATAT GTGCAaCTAA ACAAGCATGA ACTCATTCAC 
; jO I AAA-AACTTaT TGCTTGAGAA GATCAaTaCA AGAGAA.ATAT CATGCaTCCA 
1331 GGTTGaGACC TCTGCAAAGC ATTTTTaTTA TTTTGATCAC GAAAACaTCT 
: =:0 I aCGTCTTaTG GaaaTTGCTG CGaTCGaTAT TCGaGGATCT cgtcctctcg 
: -: 5 I CTGaTTAGaT GaTTTTTCTA TGTGaCCGaG CAACAGAAAA GTTACTCCAA 
. 50 I .•^aCCTaTTaC TaCaGAAAGA aTaTTTCGGA CCTCATTATC AAAATGTCAA 
: 5 5 1 TCCCAGACTT /1_aaGaaGG A A aCGCTTCCTC aGGTCCAAGA AAAAG aGGTT 
; 60 1 GaaGAATGGA AAAAGTCCCT TGG ATTTGCA CCTCGAAAAC TCaGACTAAT 
; 65 I aCCGAAGAAA ACTACTTTCC gtccaattat GACTTTCAAT aagaagattg 
J ^P-' _I'NAATTCAGA_CCGGAACACT ACAAAATTAA CTAC.AAATAC GaaGTTaTTG 
: 7 5 1 AACTCTCACT TAAfGCTtAA GACATtC AAG" A ATAGAaTCTTTaaaGaTCC " " 
i30l TTTTGCATTC GCTCTTTTTA aCTATGaTGa tctaatgaaa aactatcagg 
1351 aGTTTGTTTG CAAATGGAAG CAAGTTGCaC AACCAAAACT CTTCTTTGCA 
1901 aCTATCGATA TCGAAAAGTC aTaTCaTAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA a aTTACTTTC TTCAGATTTC tggattatga 
2001 CTGCACAAATTCTAAAGAGA AAGAATAACA TAGTTATCCA TTCGAAAAAC 
:05 I TTTAGAAAGA AAGAAaTGAA aGATTa 1 t 1 I AGACAGAAAT TCCAGAAGAT 
; !0 I TGCACTTGAA GGaGGaCAAT aTCCaaCCTT aTTCAGTGTT CTTGaaaa TG 
2151 aaCAAAATGA CTTaaaTGCa aaGaa aaCaT TaaTTGTTGA aGCAAA.GCAA 
20 I AGAAATTATT TTAAGAAAGA TaaCTTaCTT CAACCAGTCA TTaaTaTTTG 
25 1 CCaaTATAaT TaCaTTaaCT TTaaTCGGaa GTTTTaTAAa CA.aaC.---aaG 
30 I GaaTTCCTCA aCGTCTTTGa CTTTCaTCAA I I I I CTCaTC a I 1 I i ATT AT 
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AGGA.AAGCTC CTT AGGATTC CTTAGAGATG aaTC^^tGAA 
SE^^--j^-^TCC.\AATC—A aTCTTCTAATGAGACTTACA GATGACTATC 
I TTTTCA . , AC AACTCAAGAG AATAATCCAG TATTCTTTAT TGAGAA.ACTT 
pTAAACCiiAA CTCC.GAAAA TGGATTTAAA TTCA-ATATGA AGAAACTACA 
™±:P' ' ' ' CCATTAAGTC CAAGCAA.ArrTGC.A.AA.ATAC CGAATCGATA 
GTG . ioaGGa GCAAAATATT GTTCAAGATT ACTGCGATTG GATTCGCATC 
iCAATTGaTa TGAAA.ACTCTTGCTTTAATG CCA.-^ATATTA aCTTGAGAAT 
AGAAGGaaTT CTCTGTaCAC TCAATCTAAA CATGCAA.ACA A.AGAAAGCAT 
CAATG i^GCi CaaGAAGAAA CTAAAGTCGTTTTTAATGAA TAACATTACC 
^S51 r^I^'^^^ GAAAGACGAT TACAACCGAA GAC-rTTCCGA ATAAAACTCT 

'-III ^'^^•^^^^■•^ tttatatcag gcgg-ttacaa ATACATGCAA tgagccaaag 

^7r ''r-'-''- ^^-^^^'^C AaGAACTTAG CTATGAGCAG TATCAtJ^? 

- 0 Ar— ^"'-^f^'^TATACTCTCTAACCAGAGCATTCT-n-AAATA 
-•001 CC mCio iuC aaTATTAaGG aTaCA-ATTTT tggagaggag cattatccag 
-0,' V^IIr7-£L"^-GCACACTG AAGCACriTA TTGAA.ATATT CaGCac!JC. A 

: ; ° ' rr-Jf'^- ' l~- '5-^cagagt TTGCATGATC CTCAAGGC.-^.A ^aga.agca1a 

:-n ; ' ^-CCAATGTC AaTCTCTAaT TCAATATGaT GCATAGTCGA 

ri;^^rL^J±:E ^^t^gga AAGTTA.Arrr tca-atttttg TCTTArAiAr 

^--Goo 1 1 1 I u GGGTTTTGGG GTTTTGGGG 



-501 
255 ! 
2601 
2651 
2701 
2731 
2S0I 
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! :V(EVDVDNQAD NHGiHSALKT CEE!K£AKTL YSWIQKVIRC RNQSQSF-'-CD 
i! LEDUOFAQT Nl VATPRID^i^ EEDF:<VIaRX EVFSTGLMIE LIDKCLVELL 
: 0 1 SSSDVSDRQK LQCr Gr QLKG NOLAKTHLLT ALSTQKQYFF QDEWNQVRam 
;51 IGNELFRHLY TrCYLIFQRTS EGTuVpFCGN NVFDHLKVND KFDKKQKGGA 
:0I ADNCwEPRCCS TCKYNVKNEK DHFLN-NINVp NV/NNMKSRTR IFYCrnFJv/RN 
151 NQFr.<KHErV SNKJsfNTSAMD R,AQTIFTNFF RFNRIRKKLK DKVIEKJaYM 
301 LEKVTODFNFN YYLTXSCPLP ENV/R£RKQKJ ENLINKTR££ KSKYYEELFS 
3 51 Y7TDNKCVTQ FINEFF'^TML PKDFLTGRNR K>TFQKKVKKY VELNKHELH 
-0! KNLLLEKJNT REISWMQVET SAiCKFYYFDH ENrAT-WKLL RWTFEDLVVS 
-:51 LIRCFTfVTE QQKSYSKTYY YRKNTWDVTM KMSIADLKKE TLAEVQEKEV 
501 EEvVfOCSLGFA PGKLPilPKK TTFRPIMTFN KXJVNSDRKT TKLTTNTKLL 
55 1 NSHL.\1LKTLK NRiMFI-CDPFGF A VFNYDDVMK KYEErVCKWK QVCOPKLF-A 
501 TMDIK<CYDS V>/R£:<I^TPL KTTKLLSSDF \V7MTAQILKJl KN>nVIDS-^^ 
:^51 FRKKEMKDYF RQKJQKJALE CGQYPTLFSV LENEQKDLNA KJCTLrVEAKQ 
"01 RNY7:<:<I)NLL 0?vrNICQ'f>J YINFNGrCFYK QTKGIPQGLC YSSILSSF-rf ' 
"I'i aTLEESSLGF LP.DES.VfNPEN PNVNLLMRLT DDYLLITTQE NNAVLF'EKI. 
SOi f^A'SRENGFK FN.VfKKLQTSF PLSPSrCJAKY GMDSVEEQNI VQDYCDSVlCf 
S5I SIDN1K TL.AL.M PNfNLRIEGI LCTLNLNMQT KKASM\VL:<:<:< LKSFLNtNNlT 
fO! rfr ."RKTiTTE DFA^^Cu.^^ FISGGYKYMQ CAKE'r^KFK iO.'LAMSSNUD 
o: LEVSKJIYSV TR.AFFKYLVC NMJCDTIFGEE HYPDFFLSTL KIHFIEFSTK 

CO! :-:ytf?nrvcmi l:<.ak^akl:-:s docqsliqyd a 
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1 ggcacc:?2.c^r2.ccrr=rzzr=zrc3caagcc2aLzgcr-cccc^aacpc^cccaaaccrc^^gaaacaLrizricaaca 80 
5 1 accciacaacaacaccaagccaaaccrcaacacgaaggzgcracnagcgaccgacaacaccnccacLZzaccrgccgrca 160 

151 ccaagcacaaggacaaaaagaacaacccccczrcrrccaaagaccrrracrrraccaaccracrrrzcaaacacactrcg 24 0 

241 cgrrcgccracrtrraaccgcggcaccgz-rragccgccaccrccagccaaccgzgcgncrcraccccg^caccggacac 320 

3 21 acczcrrggagcagcccacagaaaccccracaaaccrncrgacgagaccacaccagacccacracagccrgcgcacacnc 400 
tOl t -aacanggagcczracaccrragacgagccacgccgcatigacggagcaLtrggraccaccraacgctrgccrrgaaaaq 480 

4 81 gc-gacaaccacccgcaaaaccacgLccc-agcggcggcaacccgcgaaagcczrzrgacgccrgcacacgcccagcacg 5 60 

5 61 ac^gagacactcaaaaaccrccacccaccacaacccrrcraacgcggccrracrccrccactrrccaLzcrcacgccgcc 640 
541 czaaacaLgcaticaccncgcaccaggcccrrcrccgcc-racccccggaaccgcaccrrctccaccacccczccraacga 720 
7 21 acaacctaaacLagccccgcctacaactgacagcagcagaaagaccggcgaccccacccgcgcaacgctaccagccraaa 600 
301 gacaczcrgcaaaacaccratcagccaccaccacacaaaaaaaacccracaaccacaaacacraaccaacacrcgcggcc 8 80 
3 81 acnacrcactraaaacgccacgaccagcaggacacccrgcacacacacagccacgcrraacggcractcgcaaccrgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 

1 M T £ H H T ? :< S R I L R r L E M Q Y V 20 

^^^1.015 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GOG TCG CCC CCA AGC TCG 107 8 

21 r L C T L N D r V Q L V L = G S ? A S S * 40 

- L0 7 7 TAT AGC AAT ATA TCC CAA CCC TTG AGA AGC GrxT G aA C.*tA , CC • - - • — - ATT TTT CTT 1.^3 8 

!~4iYSM:crRLRSDVQTS~S:rL 60. 

1 1 3 9 C.-.T TCG ACT GTA GTC CGC TTG GAG AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 119 8 

W'5iH3TVVGrDSK?0=:GVQr3S? 80 
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L?9 .:^A.A rzc ~A CAG TCA GAG ^ ca ca C3 L = r r - r r c m r z " r a ccc???a capcr^a - i c -ggcaa 127: 
31 ... , . ^ S £ - 

i7 3 CTA ATA GCG AAT CTT GTA AAA CAG ATC TTC GAT CAA ACT TTT CAG CGT CGA AGC j^jlt — . .... 
37 : A N V V K O M r n - c: ~ - = - 
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rAC ATA TCT ATA C* CAG TCA AAA AAT TGG CAA 1529 

iagacgrrgcgcacczcgaacaagaccgacaagcacag T ATC CGC 1601 

: C 155 

'CC .AAA GGA AGT ATT TTT GAG CCT CTT CCA AAT GAC 16 61 

J :< G S : F £ A L ? N O 175 



.A C * . TTT .AAA AAT 



^ AG GAA ACT CTC 1721 



? L F :< M M V :- =: £ T v 195 

GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

^ S : T 0 N ;< s A ?. ;< 215 

ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

lSRTSlT^RSS '{ 235 

ccgcraccrcrcacaaccaaccrzag AT CTA TAT TTT AAC 

L Y F N 245 

;CA CTA CAC ATC TCG CTT CAA TGG ATT TTT CCA AGC 19 67 

: V H M W L Q W : r p ^ 265 

:9Sa CAA TTT GGA CTT ATA AAC CCA TTT CAA GTG AAG CAA TTG CAC AAA CTC ATT CCA CTG GTA 2027 

V K Q L H :< V r p L V/ 285 



0 r G L r N A 



^-^,1^^^^ ^"^ CCC AAA CCT CTC CTA AAC GTA TAC CCT TTA ATT GAA CAA ACA 2087 

2 088 CCA AAG CGA CTC CAT CCT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

•'^ I- H R C S L S K V Y N H Y C ? Y : 325 

2148 GAC ACC CAC GAT CAT CAA AAA ATC CTT AGT TAT TCC TTA AAG CCC AAC CAC CTG TTT GCC 2207 

0 T H D 0 £ .< r L S Y S L K ? N Q V F A 345 



CCA TCC ATT CTT CTT CCA GTG TTT CCT AAA TTA ATC TCG GGT AAC CAA AGG ATA 22 57 



2208 TTT c: 

^ - s : L •/ R V F ? >: L : w c n :? r : 365 

.-53 ..T "AG ATA ATA TT.A AAA c c'a c crza caaaa c r r a z raccac caacc2 CI r "iccac AC CAA ACT 2 3 3 6 

^S6 F z : : L D " ^ , . 3-5 
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TCC AGA TAC GAG T~ TTT AGT TTA CAT TAT TTA ATC AGT AAC ATA AAG 23 95 

s R r E s r s L :-: r L M 3 N r :< 395 



c cas C2 cgccsaaccccrrraccscraacciacaaccag ATT TCA GAA A** GAA TCC CTA GTC CTT GGA 2C65 

IS^TH'^^LVLG 405 



2 4 6 6 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

- 406 :< ?. S N A :-c M C L s D r £ :< :< Q I r A 425 

2S2 5 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2SBS 

426 e'fIYWLYNSFI I P It-QSrFY 445 

25 8 6 ATC ACT GAA TCA AGT GAT TTA CGA AAT CCA ACT GTT TAT TTT AGA AAA GAT ATT TCG AAA 264 5 

J 46 I T £ S S D L R N R T V Y F •< D I W K 46S 

:G4 6 CTC TTC TCC CGA CCC TTT ATT ACA TCA ATG AAA ATC GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

.:66L 1 C R ? F I T S M K M £ A F £ K I N £ 485 

2706 gcaCw*zaaaccacccccz?caaaaaqcc^acac wZlzag AAC AAT CTT AGG ATG GAT ACT CAG AAA ACT 277$ 

486 - NNVRMDTQKT 495 

177 6 ACT TTC CCT CCA GCA GTT ATT CGT CTA TTA CCT .AAG AAG AAT ACC TTT CGT CTC ATT ACG 233 5 

496T L P ? A V I R L L P.K '< ^^ ^ ^ ^ I T 515. 

2336 AAT TTA AGA AAA AGA TTC TTA ATA AAG gcaccaacccrr ggccaccaact^cictcric^czraacccacca 2906 

516 M L R ;< R F L 1 :< 524 

2907 zcagcag ATC GGT TCA AAC .AAA A.AA ATC TTA GTC AGT ACG AAC CAA ACT TTA CCA CCT CTC 2967 



525 



542 



2 9*6 3 CCA TCC ATA CTC AAA CAT TTA ATC AAT CAA GAA AGT AGT GGT ATT CCA TTT AAC TTC GAG 3 027 



54 3 A 
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3 023 GTT TAC ATC AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CCA ATC TTT CG gcaac 3 088 



5 63 V r M :< L L T 
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3 155 AAA TCC TCT TAT CAT CCA ATA AAG CAA GAT TTC ATC TTT CCC ATT CTT AAA AAG AAA CTC 3215 

592 :< S C Y D R I K 0 O L M F R I V K K K 'L 611 



3 216 AAG GAT CCC GAA TTT CTA ATT CGA' AAG TAT GCA ACC ATA CAT GCA ACA AGT GAG CCA GCT 3 275 
6l2:<D?£FVrRrCYAT I HAT SOR A 631 

3276 ACA AAA AAC TTT CTT ACT GAG GCG TTT TCC TAT T gcaagccraccrccccact^gaacuct^caacaa 3343 



532 T 



643 



3 3 4 4 dczcirrrtzag TT CAT ATC GTC CCT TTT CAA AAA GTC GTC CAG TTA CTT TCT ATC AAA ACA 3 405 

544 P M V ? F £ :< V/ V Q L L S M- K T 659 

3 406 rCA GAT ACT TTC TTT GTT GAT TTT GTC GAT TAT TCG ACC i^A AGT TCT TCT GAA ATT TTT 3465 

5 50 Z D T - F V D , F ■•/ 3 Y w 7" >: S 3 St I ^ 

3 466 AAA ATC CTC AJ^G GAA CAT CTC TCT CGA CAC ATT GTT AAG gczcacciaccgcr^aaccgcadcaaca 35 32 

630 >: - - K £ H L r G H : V >: 
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4666 gcz?acccc=aaagcaagcacacracaggaccc = r2ccaaaqcaaaacc3accz=?icacczc-rrr-accracczgccc 4 74 5 

474 6 craccccracaccrcraagaaagactgacagcggccgczgacractigcccacacgcccaccaaacgggaccggLraaaca 4 825 

^82 6 r-aaaagcaacacacgaggcraacccccccrcacc-agaacaaggaaagcggcr-cccacaacgaacaacgcccgcacca 4905 

4 90 6 acgcaaaaagacgaagacraLccrcraaacaagggggacraagcacacccgaaggaaaagacaccaacacacccagcgcc 4 98 5 

4 98 6 ccigaagaaagcaaggacaacccggaacaagczrcrgcagacgacaggccaaaccrcggcgaccgaaccrrggraaaagc 506 5 
506 5 cccaggctatccacggcggccggcc^rgccaczgagacgaaaagaaaccaaggacagcccgaacacraacagcccaccca 514 5 
514 6 acgccczacacaaggccrrgctrzzcccrgaccrcaacT^zrgcacgggcgaaaagaaacagLgccaagccaccaccggac 5225 

5 22 5 cc=gaaacagccaaact:cccrggccc = r=aaaccggaagcccaaagaacc-iCtgaagczraLgaggczr=aaaaacccc 5 305 
5 3 06 c==rgact:raaaggaggaacci:zrcaccgac:csggaaacggacagcccaccagcrczcgaggagaagcc=aacczrrcgc 53 8 5 
53 8 5 aaaaaagaaaacat:cactgggacacaLci.ccrgacg3accaga.cgcggagagcaicrccagcggaccczrgacgccaaca 54 65 
5 4 66 accrcracc-ccgaaacgcacggcccraczgrcrcrrcgaccrzczgcagcmacgcagccaagccaccaaaggtacc 5 54 4 




FIGURE 16 

-page 1 
(Seq. ID.No. 1) 

1 gcagcgctgc gtccrgcrgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgccccc cgctgccgag ccgtgcgccc cctgctgcgc agccactacc gcgaggrgcc 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gcttrccgcg cgctggtggc ccagcgcctg gtgcgcgrgc cctgggacgc 
241 acggccgccc cccgccgccc cccccrtccg ccaggcgtcc tgcctgaagg agctggcggc 
3 01 ccgagtgctg cagaggctgt gcgagcgcgg cgcgaagaac gcgctggcct tcggcrtcgc 
361 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 
481 ccgcgtgggc gacgacgcgc tggttcacct: gctggcacgc tgcgcgccct ttgtgctggt 
541 ggctcccagc tgcgcctacc aggcgtgcgg gccgccgctg taccagcccg gcgccgccac 
601 tcaggcccgg cccccgccac acgctagcgg accccgaagg cgcccgggac gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggr ccccctgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagcgcca gccgaagtct gccgttgccc aagaggccca ggcgcggcgc 
781 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
841 gcgnggaccg agtgaccgrg grttctgtgc ggcgrcacct gccagacccg ccgaagaagc 
901 cacctcrttg gagggcgcgc cctctggcac gcgccactcc cacccatccg tgggccgcca 
961 gcaccacgcg ggccccccac ccacaccgcg gccaccacgt ccctgggaca cgccttgtcc 
1021 cccggtgnac gccgagacca agcacctccc ctacccctca ggcgacaagg agcagccgcg 
1081 gccctccrrc ctactcagct ccctgaggcc cagccrgact ggcgcccgga ggctcgcgga 
1141 gaccatcnrt crgggttcca ggcccrggac gccagggact ccccgcaggc tgccccgccc 

12 01 gccccagcgc tactggcaaa tgcggccccr gtrtctggag ctgcctggga accacgcgca 
1261 gtgcccccac ggggcgctcc ccaagacgca ctgcccgccg cgagccgcgg tcaccccagc 
1321 agccggtgcc tgtgcccggg agaagcccca gggcrccgtg gcggcccccg aggaggagga 

13 81 cacagacccc cgccgccrgg tgcagccgct ccgccagcac agcagccccr ggcaggtgta 
1441 cggctrcgrg cgggccrgcc tgcgccggct ggcgccccca ggccrctggg gctccaggca 
1501 caacgaacgc cgcttcctca ggaacaccaa gaagcncatc ccccrgggga agcatgccaa 
1561 gccctcgccg caggagcrga cgcggaagac gagcgtgcgg gacrgcgcrt ggccgcgcag 
1621 gagcccaggg gctggccgrg ttccggccgc agagcaccgt ccgcgrgagg agatcctggc 
1681 caagttcccg cactggccga tgagtgrgca cgccgrcgag ctgcrcaggc ctttccttta 
1741 tgtcacggag accacgcttc aaaagaacag gcccctttcc taccggaaga gtgcctggag 
18 01 caagctgcaa agcattggaa tcagacagca crtgaagagg gtgcagctgc gggagctgtc 
1861 ggaagcagag gncaggcagc atcgggaagc caggcccgcc ctgccgacgt ccagaccccg 
1921 cttcatcccc aagccrgacg ggccgcggcc gattgtgaac atggactacg tcgtgggagc 
1981 cagaacgctc cgcagagaaa agagggccga gcgcctcacc tcgagggcga aggcactgrt 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc crgggcgcct ctgtgctggg 

^101 cctggacgac acccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161~ gccgcctgag ctgtacctt'g tcaaggtgga- tgtgacgggc- gcgracgaca-ccatccccca_ 
2221 ggacaggctc acggaggrca tcgccagcat catcaaaccc cagaacacgr actgcgcgcg 
22 81 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 
2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggccc acctgcagga 

24 01 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctga atgaggccag 
2461 cagcggcctc ttcgacgrcc tcctacgcrt catgrgccac cacgccgcgc gcatcagggg 
2521 caagncctac gtccagtgcc aggggatccc gcagggcrcc accccctcca cgctgctccg 

25 81 cagcctgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggcn 
2641 gcccctgcgt ttggcggacg atrtcttgct ggcgacacct cacctcaccc .acgcgaaaac 
2701 cttcctcagg accccggccc gaggtgtccc tgagtatggc tgcgtggtga acttgcggaa 
2761 gacagtggtg aacttcccrg cagaagacga ggcccrgggc ggcacggctt ttgttcagat; 
2821 gccggcccac ggcctatccc cccggrgcgg cctgctgctg gatacccgga cccnggaggc 
2881 gcagagcgac tactccagcc atgcccggac ctccaccaga gccagtctca ccctcaaccg 



2941 cggcntcaag gctgggagga 
3001 tcacagccrg tttcrggact 
3061 caagacccrc ctgcrgcagg 
3121 tcagcaagct tggaagaacc 
3181 ccgcracrcc atccrgaaag 
3241 cggcccrcrg ccctccgagg 
3 3 01 gactcgacac cgtgtcaccc 
33 61 gccgagtcgg aagcrcccgg 
3421 accgccccca gacctcaaga 
3 4 81 gagcagacac cagcagcccr 
3541 cacacccagg cccgcaccgc 
3601 catgcccggc tgaaggcrga 
3661 gagtgrccag cacaccrgcc 
3721 gggccagcct ttccccacca 
3781 ccagactcgc cattgctcac 
3 841 aggcggagac ccrgagaagg 
3 901 ccctgcacac aggcgaggac 
3 961 gaggcgccgt: gggagraaaa 
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page 2 
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acacgcgtcg caaacccctt 
tgcaggcgaa cagcctccag 
cgnacaggtm tcacgcacgt 
ccacattrtt ccngcgcgtc 
ccaagaacgc agggatgtcg 
ccgngcagcg gctgrgccac 
acgtgccacn cctggggtca 
ggacgacgcn gacrgccccg 
ccatcctgga ctgatggcca 
gtcacgccgg gctctacgtc 
tgggagccrg aggcctgagc 
gcgtccggcn gaggccrgag 
gtcttcacct ccccacaggc 
ggagcccggc ttccactccc 
ccctcgcccn gcccrccrrt 
accccgggag ctctgggaat: 
cccgcaccrg gatgggggnc 
tactgaacac atgagttttt 



ggggccrtgc ggccgaagtg 
acggrgtgca ccaacatcta 
gcgcrgcagc tcccatttca 
acctcrgaca cggccrcccc 
ccgggggcca agggcgccgc 
caagcattcc tgcrcaagct 
cccaggacag cccagacgca 
gaggccgcag ccaacccggc 
cccgcccaca gccaggccga 
ccagggaggg aggggcggcc 
gagtgcrtgg ccgaggcccg 
cgagcgtcca gccaagggcc 
^gscgctcgg ccccacccca 
cacacaggaa tagtccaccc 
gccttccacc cccaccatcc 
rcggagtgac caaaggrgrg 
cctgcgggcc aaacnggggg 
cagcrrrgaa aaaaa 



FIGURE 17 
HUMAN TRT PROTEIN SEQUENCE 
(SEQ. NO. 2) . 

MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 

AAFRALVAQCLVCVPWDAR? PPAAPS FRQVS CLKELVARVLQRLCERGAKNVLAFGFA 
LLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFV 
LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPA 
PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPA 
RPAEEATSLEGALSGTRHSKPSVGR0KHAGPPSTSRPPRPV7DTPCPPVYAETKKFLYS 
SGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL. 
FLELLGNHAQCPYGVLLKTKCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ 
LLRQHS S PWQVYGFVRACLRRLVP PGLWGSRHNERRFLRNTKKFI SLGKHAKLSLQEL 
TWKMS VRDCAWLRRS PGVGC7P AAEKRLREE IIJUCFLHWTuMSVYVVElJ^RS FFYVTET 
TFQKNRLFFYRKS VWS KLQS I G I RQHLKRVQLRELSEAEVRQHREARPALLTSRLRF I 
PKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLG 
LDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIAS I IKPQNTYC 
VRRYAWQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSL 

neassglfdWlrfmc^^ 

IRRDGLLLRLVDDFUjVTPHLTHAKTFIJITLVRGVPEYGCVVNLRKTVVNFPVEDEAL 
GGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRR 
KLFG VLRLKCHS LFLDLQVNS LQTVCTN I YKILLLQAYRFHACVLQLPFHQQVWKNPT 
FFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVT 
YVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD 



FIGURE 18 

Clone 712562 
(SEQ ID NO. 3 ) 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCG 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCT CAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCT CTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCrrCACCCACGCGAA;^ 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCrCACCTTC^^ 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTGACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCrrCCAGACGGTGTGCACC^^ 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCAT^ 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCT 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
--CACTGCCCTCAGACTTCAAGACCATCCTGGACT.GATGGCC^ 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCrTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCrrCC^ 
AGGGCCAGCTTTTCCrCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCC^ 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACC:^ 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 




FIGURE 19- 

SEQ ID NO. 10 
MetSerValTyrValValGluLeuLeuArgSerPhePhe 

TyrValThrGluThrThrP-heGlnLysAsnArgLeuPhePheTyrArgLysSerValTrp 
SerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGluLeu 
SerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArgLeu 
ArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValValGly 
AlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAlaLeu 
PheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerValLeu 
GlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGlnAsp 
ProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrllePro 
GlnAspArgLeuThrGluVal I leAlaSer 1 1 e I leLys ProGlnAsnThrTyrCys Val 
ArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLysSer 
HisValLeuArgProValProGlyAspProAlaGlyLeuHisProLeuHisAlaAlaLeu 
GlnProValLeuAxgArgHisGlyGluGlnAlaValCysGlyAspSerAlaGlyArgAla 
AlaProAlaPheGlyGly 
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SEQUENCE NO. 4 (DNA) AND SEQUENCE NO. 5 (PROTEIN) 
(TRANSLATION OF A Al 82 hTRT VARIANT) 



GCAGCGCTGCGTCCTGCTGCGCACGTGGGA 



pro arg ala pro arg cys arg ala 
CCG CGC GCT CCC CGC TGC CGA GCC 

20 

his cyr arg glu vai leu pro leu 
CAC TAG CGC GAG GTG CTG CCG CTG 

gly pro gin gly crp arg leu val 
GGG CCC CAG GGC TGG CGG CTG GTG 

50 

phe arg ala leu vai ala gin cys 
TTC CGC GCG CTG GTG GCC CAG TGC 

ala arg pro pro pro ala ala pro 
GCA CGG CCG CCC CCC GCC GCC CCC 

80_^__ 

leu lys giu leu vai ala arg vai 
CTG AAG GAG CTG GTG GCC CGA GTG 



gly ala lys asn vai leu' aia phe 

GGC GCG AAG AAC GTG CTG GCC TTC 

110 

ala arg gly gly pro pro glu. ala 

GCC CGC GGG GGC CCC CCC GAG GCC 



1 

men 

SCCCTGGCCCCGGCCACCCCCGCG ATG 
10 

vai arg ser leu leu arg ser 
GTG CGC TCC CTG CTG CGC AGC 

30 

aia chr phe vai arg arg leu 
GCC ACG TTC GTG CGG CGC CTG 

40 

gin arg gly asp pro aia aia 

CAG. CGC cm OAC CCG GCG GCT 

60 

leu vai cys vai pro trp asp 
CTG GTG TGC GTG CCC TGG GAC 

70 

ser phe arg gin val ser cys 
TCC TTC CGC CAG GTG TCC TGC 

90 

leu gin arg leu cys glu arg 
CTG CAG AGG CTG TGC GAG CGC 

100 

gly phe aia leu leu asp gly 
GGC TTC GCG CTG CTG GAC GGG 

120 

phe chr thr ser vai arg ser 
TTC ACC ACC AGC GTG CGC AGC 




tyr leu pro asn thr val thr 
TAC CTG CCC AAC ACG GTG ACC 

140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala cyr gin val cys gly pro 



FIGURE 20 

.Page 2 

130 

asp ala leu arg gly ser gly ala 
GAC GCA CTG CGG GGG AGC GGG GCG 

150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 



190 

chr gin ala arg pro pre pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 



200 210 

leu gly cys glu arg ala crp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 



220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT _GCC AGC _CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 



250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 



260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 
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280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG 7CA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380. 390 
__arg_^xg_ lj2U pro _arg_ J.^u_p^o^^l^ trp gin met arg pro 

CGC AGG TTG CCC CGC CTG CCC CAG CGC^ •I^"^GG~CAA~ATG~CG(r~CCC " 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 



430 
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ala ala gly val cys 
GCA GCC GGT GTC TGT 

440 

ala pro glu glu glu 
GCC CCC GAG GAG GAG 



leu arg gin his ser 
CTC CGC CAG CAC AGC 

470 

ala cys leu arg arg 

GCC TGC CTG CGC CGG 

his asn glu arg arg 

CAC AAC GAA CGC CGC 

500 

leu gly lys his ala 

CTG GGG AAG CAT GCC 



mec ser val arg asp 
ATG AGC GTG CGG GAC 

530 

gly cys val pro ala 
GGC TGT GTT CCG GCC 



ala lys phe leu his 
GCC AAG TTC CTG CAC 

560 

leu arg ser phe phe 
CTC AGG TCT TTC TTT 



arg leu phe phe tyr 
AGG CTC TTT TTC TAC 



ala arg glu lys pro 
GCC CGG GAG AAG CCC 



asp thr asp pro arg 
GAC ACA GAC CCC CGT 

460 

ser pro trp gin val 
AGC CCC TGG CAG GTG 



leu val pro pro gly 
CTG GTG CCC CCA GGC 

490 

phe' leu arg asn thr 
TTC CTC AGG AAC ACC 

lys leu ser leu gin 
AAG CTC TCG CTG CAG 

520 

cys ala trp leu arg 
TGC GCT TGG CTG CGC 



ala glu his arg leu 
GCA GAG CAC CGT CTG 

550 

trp leu met ser val 
TGG CTG ATG AGT GTG 



tyr val thr glu thr 
TAT GTC ACG GAG ACC 

580 

arg lys ser val trp 
CGG AAG AGT GTC TGG 



gin gly ser val ala 
CAG GGC TCT GTG GCG 

450 

arg leu val gin leu 
CGC CTG GTG CAG CTG 



■ tyr gly phe val arg 
TAC GGC TTC GTG CGG 

480 

leu trp gly ser arg 
CTC TGG GGC TCC AGG 



lys lys phe ile ser 
AAG AAG TTC ATC TCC 

510 

glu leu thr trp lys 
GAG CTG ACG TGG AAG 



arg ser pro gly val 
AGG AGC CCA GGG GTT 

540 

arg glu glu ile leu 
CGT GAG GAG ATC CTG 



tyr val val glu leu 
TAC ^ GTC GTC GAG CTG 

570 

thr phe gin lys asn 
ACG TTT CAA AAG AAC 



ser lys leu gin ser 
AGC AAG TTG CAA AGC 



V 
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590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG GAG CTG CGG GAG CTG 

610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
. CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn mec asp tyr' val val gly ala arg thr phe arg 
CCG ATT GTG .AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
:TC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 

ACC -TTC -GTG^ CT_G_^CGT_GTG_CGG_GCC__CAG_GA^ _GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 



730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 
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740 750 
asn chr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

8 00 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTC. : ■ • :^AACATGCGTCGCAA 

_ACTCTTTGG_GGT^i- ^ k_.;.J.:^^..^^^.;,^iV^iX^ 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
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GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGG7CCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



t 

V. 



FIGURE 2 r 

Genomic DNA insert of pGRN144 
Seq. ID. No. 6 

1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCAGCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCrrCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 



421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCT.CTGCC^CA^ 



CAP 

= BS&B&SBaSSSS8SS8S> 



541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



6 01 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 



0 
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661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTTTTTTATTGT 
CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACyVCTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 

CAP 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 

CAP 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 



961 AACCAGTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCCAAACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTTU^GCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

12 01 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCA^ 



1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCAAAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

Topoll 
★*♦♦♦**★**★★**♦♦♦> 

13 81 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 



14 41 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
• CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 
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1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCGGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 

1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCT^AACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

1801 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCC7VGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 



2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2 221 _CGGGCy^p^ACG 

GCCCGTTGTCTGCGGGTCOT(MCGCGAAGGGTG 

Spl 

E2F 



2281 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 





FIGURE 21 (continued, 4 of 6) 

Spl 



E2F NFkB 
***★*★★** **♦*♦*♦*♦ 

24 01 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRTS ' 

**★★*♦♦♦*********♦♦****♦****★> 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 

E2F 

2581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



2701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 



Topo_II_cleavag 

NFkB 
++-f +++++++ 
NFkB 



intronl 



2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 



e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2 941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 




. FIGURE 2 1 (continued, 5 of 6) 

3 001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3 061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

3 301 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 

3 361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3601 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

"3781~"GGAGCTGCTTGGGAACeAeGGGeAGTGGGGGTAGGGGGTGCTCCrCAAGACGCACTGCCC-— 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3 841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 

CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 
CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4 021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 

GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 




FIGURE 21 (continued, 6 of 6) 

4 081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Intron2 

★*♦*★**★♦**★★♦♦★***♦♦★♦**★****★★***★**★*★**★★★♦★**♦♦★*♦★*•♦♦* 

4201 AGGCCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



4 261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



4321 



GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 
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FIGURE 23 

EST AA281296 
(Seq. ID. No. 8) 

gc 

caagttcctg cactggctga tgagtgtgta cgtcgtcgag ctgctcaggt ctttctttta 
tgtcacggag accacgtttc aaaagaacag gctctttttc taccggaaga gtgtctggag 
caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggacgrgnc 
ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ctgctgacgt ccagactccg 
cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgcgggagc 
cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactgtt 
cagcgtgctc aactacgagc gggcgcg 



FIGURE 24- 
(Seq. ID. No. 9) 



TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 

ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGC 

CAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCAT 
CAGGGGCAAGTC 




Approximate Cell No. 



5,000 5,000 5,000 5,000 
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NXT Heparin Affinity Q^ycgrad. 
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Figure 34 

I CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

5 1 GTAGTTTAG A AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

1 0 1 TTTGG ATG AT ATAGAAAATT TACTTCCTAA TACATTCAAC aagtatagca 

1 5 ! GCTCTTGTAG TGACAAG AA A GG ATGCAAAA CATTGAAATC TGGCTCGAAA 

201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

25 1 ctcggatgca AATCTTTATA aCGATTCTTT CTTGAGAAAA TTAGTTTTAA 

301 AAAGCGGAGA GCAAAGAGTA gaaattgaaa cattactaat gtttaaataa 

35 1 AATCAGGTAA TG AGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

401 ttatggagaa AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 

451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

50 1 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 

551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA cttattgcat TTACTATTCG 

601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA ctcccgagtc 

65 1 TTGAGAC AAT TG AAAAAGCT GTTTACAACT GAAGGAATCG. CAGTTCTGAA 

701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 

751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 

85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 

90 1 TTACAAC AGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 

951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

1 00 1 TTGTTG Aire TTCTGTAACC GGAATTAACA AC AAG AATAT TAGCAACGAA 

1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG 1111 ICATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TA 1 1 1 1 1 1 G A TTAGCTGGAA 

1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

1 25 i tcacattcat agatcgacct TCATATATCC aatacgatga taaggaaaca 

! 301 gcagtcatcc gttttaaaaa tagtgctatg aggactaaat ttttagagtc 

1351 AAGAAATGGa CCCGAAATCTTAATCAAAAA gaattgcgtc gatattgcaa 

1401 AAGAATCGaa CTCTAAATCTTTCCTTAATA agtattacca atcttgattg 

1451 attgaagaga ttgacgaggc aactgcacag aagatcatta aagaaataaa 

! 501 gtaactttta ttaattagag aataaactaa ATTACTAATA TAGAGATCAG 

1551 CGaTCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

!601 ATaCAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA ccagttagca 

1651 aaagaaaaaa taaggcaata aataaaatga gtacagaagt gaagaaataa 

I 701 AAGATTTaTT I 11 I iCAATA ATTTATTGAA AAGAGGGGTT TTGGGG 1111 

1-75-1- -GGGGTTTT-G G-GG " 






CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
1 ^ ^ ^ ^ ^ ^ 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTT^ 

a PQNPKTPKPL*KKKKLR*FR 

b PKTPKPQNPYKKRKN* GSLE - 

c PKPQNPKTPIKKEKIEVV*K- 



AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAAT^ 

" ~ r r — ^ — — H ^ 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCT-irrTAA 



a NKILFPHKWRWILIWMI*KI 

b IKYYSRTNGDGY*FG*YRKF- 

C *NIIPAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 
121 ^ ^ + ^ ^. 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTSIAALVVTRKDAK 

b TS*YIQQV*QLL*CQERMQN- 

c LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACA^ 
181 + ^ ^ ^ ^ ^ 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH*LFQS CKNN* 

b lEIWLEIAFIDYSKVAKTIR- 

C I-KSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATO 

241 + + + + : + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTA^ 

a SSTSRMQIFITILS*EN*F* 

b V LLLGCKSL*RFFLEKISFK- 

c FYFSDANLYNDSFLRKLVLK- 



AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

+ + + + — + 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 



A_E_S KE*KL KHY*CLiNKIR* 

b K R R A K" S~R^-N — * N — I T — N— * 1 K— S G N - 

C SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTAC 
361 + + + + ^ ^ ^ 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 



a *GLFYFLDHFLRSIMEKIT* 

b EDYSIF*ITS*GALWRKLLN- 

c RIILFFRSLLKEHYG ENYLI- 

FIG. 35 



( 



421 '^t'zZ^^^'^'^^'^^^'^'^'^^'^^'^'^^^^'^^^^^^^ 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTC^^ ^ ^ ° 

a Y»KVNSLDYFPSQQ**VY*T 

b TKR*TVWIISLANNDEYIK F- 

C t'KGKQFGLFP*PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCC3ATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 — — — — — + — — — — — 4. — _ — — c>i n 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTT 

a HMRMSQRISIHQTYQROTRY 

^ * E„ * V K G S R Y I R L T K D K L A I - 

C YENESKDLDTSDLPKTNSL*" 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 
541 + (- J. ^ ^ 600 

TTTTGCGTTCTTrTTCAAACTATTAGCTTGTCGTCTTCTTGAATA^ 

a KTQEKV**SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

C NARKSLIIEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 
601 + + ^ . ^ ^ ^ 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*T PES*DN 

b MGFITIVLGIDGELPSLETI- 

C WVLLQLF*VSTVNSRVLRQL- 



TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 
ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACAC^ 



a *KSC LQLKESQF*KFCCVCH 

b EKAVYN*RNRSSESSDVYAI- 

C KKLFTTEGIAVLKVLMCMPL- 



TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

+ + + - 

ATAAAACACTTAATTAGAGT^ATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 



a YFVN*SQI SYLNLMDSYRNK 

b IL *INLKYLISI*WIAIETN- 

c FCELISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTCGGACAAATGCAC^^ 
- 7-8 1 - - ~ - -^^ - - - - - — - + - - - - - - - - - - - -_- - - - « — - - - — + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTO 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

C K*TMQV*WNIR*ILWDKCTL- 



TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGAC 

+ + + 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 



a *IYIGFLKHRYTECFRD*FS 

b EFILDS*SIDTQNALETDLA- 

C NLYWILKA*IHRML*RLI*L- 
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901 '^^^^^^^'"^^^'^^'^'^'^^'^'^^^'^^'^ 

AATCTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAAT 

a LQQITCFDYSCSSLIS LKEA 

b YNRLPVLITLAHLLYL*KKO- 

C TTDYLF*LLLLISyi FKRS R- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTC^^ 
961 — — — — — — — — — 4- — — — — — ——— — — + — — — — — ————. -f — — — — — — — — — 4. — _ ^ 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACA^ 

a GEMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNEKKTKERDFKIC*FFCNR- 



GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGAT^ 
"'~***'~~'"~"~^*~~*""~*"~**'~^'"~ — ~— — — ——+ — — — — — — — — —+""—— — — — — — + — — — — — »» — »^ 

CCTTAATTGTTGTTCTTATAATCGTTGCTT-TTTCTTC 

a GINNKNISNEKEEELSQS*F 

b ELTTRILATKKKKSYHNPDS- 

C N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 
1081 + + + + ^ ^ ^ 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 



a liKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS- 

C KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATAT^ 
1141 + + + ^ + ^ 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCyFLLS*QYFLISWK- 

C FHFTAVIFFYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 
1201 * ^ * ^ ^ ^ 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

C KKYQIREALD*GNLAYSHS*- 



AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

• »■ — - - -f - — _ - _ — - - - -- - - . ^ - 4 - 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 



a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

C IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 
^321 + + -»--- + ^ 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED*IFRVKKWSRNL NQ-K 

b SAMRTKFLESRNGAEILIKK- 

c VL*GLNF*SQEMEPKS*SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCG^CTCTAAATCTTTCGTTAATAAGTATTACCA 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAG^ 

a ELRRycKRIEL*IFR**VLP 

b NCVDIAKESNS KSFVNKyyo- 

C ^ASILQKNRTLNLSLISIT N- 

1441 ^^^'^^^'^^'^^^^^^'^'^ACGAGGCJ^^ 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCT^ ^ ^ ^ ° 

a ILIDCRDCRGNCTEDH*RNk 

^ ^*LIEEIDEATAQKIIKEIK- 

C LD*LKRLTRQLHRRSLKK*S" 

1501 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTC ^ ^ ^ ^ 

a VTFIlSr*RlN*-ITNlEISDLQ 

b *LLLXRE*TKLLI*RSAIFN- 

C NFY*LENKL NY*YRDQRSSI- 



TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTC^^ 
AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACC^ 



a LTK*KLN*S*TIKNTNLGQN 
b *RNKS*TKVRQ*KIQTIiVKI 
c DEIKAELKLDNKKYKPWSKY 



ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAA^ 

■* + — + 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTAT^ 



a IEEGKEDQLAKEKIRQ*IK* 

b I'RKEKKTS*QKKK*GNK*NE- 

C *GRKRRPVSKRKNKAINKMS- 



GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAA^ 

( + — — — — — — — + — — ^ — — — — ^. 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 



a VQK*RNKRFIFFNNLLKRGV 
_b^__Y R_S^ E^KDLFFSIIY*KEGF- 
c T E V K K ♦ K I~-Y — F^p-Q"* — F p— E — K — Q— P-- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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Figure 42 



I aacicama atiactaan taatcaacaa gangataaa aagcagiaaa taaaacccaa 
61 tagatttaat ttagaaagia tcaangaaa aatggaaan gaaaacaact aagcacaata 
1 2 1 gccaaaagcc gaaaaangt ggtgggaacl igaartagag aigcaagaaa accaaaaiga 
1 8 1 Utataagn agggtiaaga ngacgaicc taagcaatat cicgtgaacg icactgcagc 
24 1 atgUTgng taggaaggta grtactacta agataaagat gaaagaagat atatcatcac 
301 taaagcacn cngaggtgg ctgagtciga tcctgagnc atctgctagt tggcagtcta 
361 caiccgtaai gaactrtaca tcagaactac caciaaciac attgtagcat tnglgngt 
42 1 ccacaagaai actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacnactg gaagtcigtg aatngcata ggttctctat attmgatg caacigaan 
54 1 caaaaamg tatcngata ggatacmc ataagatan cgtaaggaac tcactnccg 
60 1 taagtgma caaagatgcg tcagaagcaa gtmcigaa ncaacgaat actaacngg 
66 1 taagtangc actgaatcct aacgtaagaa aacaatgnc cgtiacctct cagttaccaa 
72! caagtaaaag igggartaaa ctaagaagaa gagaaaagag aatctcttaa cxaaacrtta 
781 ggcaataaag gaatctgaag ataagiccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaaic aaggctnaa aaccagcagt tatgaagaaa atagccaaga gatagaaigc 
901 catgaagaaa cacatgaagg cacxtaaaai tcctaactct accrtggaat caaagtacn 
96 1 gaccrtcaag gatctcatta agttctgcca tamctgag cctaaagaaa gagtctataa 
1 02 1 gatccnggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtganc 
1081 tgcatcigca cccncaatc ctgaanggc tggaaagcgt atgaagang aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacacigct gaggntggg ataatttaat 
1 20 1 ttcaagcaat taactcccat atatggccat gttacgtaac ngtctaaca icttaaaagc 
1 26 1 cggtgtnca gatactacac actctangt gatcaacaag arrtgtgagc ccaaggccgt 
1321 tgagaactcc aagatgncc ctcncaan ctttagtgcc attgaagctg ttaatgaagc 
1381 agnactaag ggancaagg ccaagaagag agaaaaiatg aatcnaaag gtcaaatcga 
1441 agcagiaaag gaagngng aaaaaaccga tgaagagaag aaagatatgg agrtggagta 
1 50 1 aaccgaagaa ggagaamg naaagtcaa cgaaggaan ggcaagcaat acattaactc 
1561 cangaacn gcaatcaaga tagcagnaa caagaama gatgaaatca aaggacacac 
1621 tgcaaicnc ictgatgm ctggnctai gagtacctca atgicaggtg gagccaagaa 
1681 gtatggncc gncgtacn gicicgagig tgcanagic cnggmga tggtaaaaia 
1741 acgngtgaa aagtcctcat tctacatcn cagncacci agnctcaai gcaataagtg 
1 80 1 nacnagaa gngatctcc ctggagacga aciccgicct tciatgtaaa aacrtttgca 
1861 agagaaagga aaacnggtg gtggtactga mcccctat gagtgcang atgaatggac 
1921 aaagaataaa acicacgtag acaataicgt tatmgtct gataigatga ngcagaagg 
1981 atancagat atcaatgtta gaggcagnc canguaac agcatcaaaa agiacaagga 
704_!_tgaagtaaat cctaaMju_aaaicmgc agngacna gaaggnacg gaaagtgcct 
2101 taatciaggt gatgagnca atgaaaacaa'ctacatcaag aiartcggta igagcganc 
2161 aatcnaaag ncamcag ccaagcaagg aggagcaaai aiggtcgaag natcaaaaa 
222 1 cmgcccn caaaaaatag gacaaaagtg agmcnga gancncta taacaaaaat 
2281 ctcaccccac tmngm tangcatag ccattatgaa amaaarta natciatn 
234 1 amaagna cnacatagt rtatgtatcg cagtctana gcctancaa atganctgc 
2401 aaagaacaaa aaagactaaa a 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKJDDPKQYL 

VNVTAACLLQEGSYYQDKX»ERRYIITKALLEVAESDPEFICQLAVYIRNELYIRTTTN 

YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRKELTFRXCLQRCVRSKJ^SEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTK 

KJ<JlK£NLLTKLOAlK£SEDKSKJl£TGDINWVEDAIKALKPAVMKJ<^^ 

APKJPNSTLESKYLTFKDLIKfCFOSEPKERVYKJLGKKYPKTEEEYKAAFGDSASAP 

FNPELAGKJINQCJEISKT\VENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGV 

SDTTHSrVINKJCEPKAVENSKJv4FPLQFFSAIEAVNEAVTKGFKAKKRJEhnvlNLKGQIE 

AVKEVN^KTDEEKJODMELEQTEEGEFVKVT^EGIGKQYINSIELAlKiAVNKNLDErKG 

HTAJFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 

CNKCYLEVDLPGDELRPSMQKLLQEKGKXGGGTDFPYECIDEV^TTKNKTHVDNIVILSD 

N^lAEGYSDrNVRGSSIVNSIKXYKDEVNPNIiaFAVDLEGYGKCLNLGDEFNENNYT 

KJFGMSDSILhCFISAKQGGANMVEVlKNFALQKJGQK 




Figure 44 

caatactat taaoaataa" ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttganttg tattacaaaa 
tctagaagtt tacaaaagcc agattgagca tiataagacc tagtagtaat agatcaaaga 
ggaggatctc aagctmaa agncaaaaa naagattag gatggaaact ctggcaacga 
tgatgatgat gaagaaaaca acccaaataa ataataagaa ttattaagga gagt'caatta 
ganaagtag caagmaat tgataaaaaa agnggnct aaggtagaga aagamgaa 
rttgaacgaa gatgaaaaca aaaagaatgg acmctgaa tagcaagtga aagaagagta 
anaagaacg artactgaag aataggttaa gtanaaaat ttagtattta acatggacta 
ccagrtagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagartatga 
tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaar tatgtatcaa tttacgccaa 
ctaaaagaca tcatangn ggtggcttaa agattatttt aataaaaaca attatgatca 
tcrtaatgta agcanaaca gactagaaac tgaagccgaa ttctatgcct ttgatgam 
ttcacaaaca atcaaactta ctaataanc ttactagact gttaacatag acgttaattt 
tgataataai ctctgtatac tcgcangct tagamrta rtatcactag aaagancaa 
tartngaat ataagatcn cnatacaag aaattaatat aamtgaga aaattggtga 
gctacngaa actatcncg cagngtcn nctcatcgc cacttacaag gcattcam 
acaagncct tgcgaagcgt tctaatatn agnaactcc tcatcataaa ttagcgnaa 
agatagciaa nataggtai actcmctc tacagacna aaatiagttg acactaacaa 
agiccaagat lattttaagt tcrtataaga anccctcgt ttgactcatg taagciagia 
ggctatcxca gtiagtgcta ctaacgctgt agagaaccic aatgtmac ttaaaaaggt 
caagcaigct aatcrtaatt lagmctat ccctacctaa ttcaartttg atttctactt 
tgttaama caacatnga aanagagn tggattagaa ccaaaiarrt tgacaaaaca 
aaagcttgaa aatctacm tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
aaactrrtac acctacgttg crtaagaaac ctccagaaaa cagatatcaa aacaagctac 
aacaaicaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
aactccaagc gaaagcacaa gtggtatgaa atrrrngat catcmctg aartaaccga 
gcngaagat ncagcgna acngtaagc tacccaagaa amatgata gcrtgcacaa 
acrmgaa agatcaacaa amaaagaa gncaaana agttacaaat atgaaatgga 
aaagagxaaa aiggatacat tcatagaict taagaaian tatgaaacct taaacaatct 
taaaagatgc tctgttaata tatcaaatcc tcatggaaac amcrtatg aactgacaaa 
taaaganct acrrmata aamaagci gaccnaaac laagaanat aacacgctaa 
giaiacrm aagtagaacg aatmaan taataacgtt aaaagtgcaa aaartgaatc 
rtccfcana gaaagcnag aagatanga lagicmgc aaaiciang cncngiaa 
aaamacaa aaigttaata naicgccag mgccctat cccaacaata tnagaaaaa 
tcctncaat aagcccaatc nctartm caagcaam gaataattga aaaatttgga 
aaatgtatct atcaactgta ncngatca gcataiactt aatictatti cagaartcn 
agaaaagaat-aaaaaaataa.aagcartcat tttgaaaaga tattatttat tacaatatta 
tctxganat actaaartat ttaaaacact icaatagna cctgaactaa actaagtrta 
cattaanag caanagaag aangactgt gagtgaagta cataagtaag tatgggaaaa 
ccacaagcaa aaagctnct atgaaccan atgtgagtn atcaaagaai caicctaaac 
ccmagcta atagatmg accaaaacac tgtaagtgat gactctatta aaaagarm 
agaatctata ictgagtcta agiatcatca natngaga ngaacccta gttaaictag 
cagmaan aaatclgaaa acgaagaaai tiaagaactt cicaaagctt gcgacgaaaa 
aggtgrrna gtaaaagcai actataaan ccctctaigi tiaccaactg gtacnarta 
cganacaat tcagaiagat ggtgartaat taaaianag maaataaa tanaaatat 
igaatamc mgcnan amgaataa tacatacaat agicaimt agtgrmga 
aiaiatma gnarnaat tcartarrn aagtaaataa natrmca aicatrnn 
aaaaaaicg 
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STDLKiVDTNKVQDYFKfLQEFPRI.THVSQQAIPVSATNAVENLhAn.LKKVKHA>fLNL 

VSIPTQFNFDFYFVNLQHLKJLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQlLKQATTIKJvJLKNNK>iQEETPETKDETPSESTSGMKFFDHLSELTELED 

FSVNLQATQEIYDSLHKLLIRSTNLKJO^KI.SYKYEMEKSKJvlDTFIDLKNri'FIl.NNLK 

RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQ^fEFQFN^fVKSAKIE 

sssLESLEDiDSLCKSiASCK>JLQNVN'iiASi^> -r 
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DSIFCKJLESISESKYHHYLRLNPSQSSSUKSENEEIQELLKACDEKCVLVKAYYKFP 

LCLPTGTYYDYN'SDRW 
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MKJLFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRKJALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKJARNEDVNNSLFCHS 

ANVNVTLLKGAA.WKMFHSLVGTYAFVDLLrNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNrNSSSFFPYSKJLPSSSSIKKLTDLR 

EAIFPTNLVKJPQRLKVRrNLTLQKi.LKJlHKRJLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLFCFIIVILQKLLPQEMFGSKLKNKGKJIKNLNLLLSLPLNGYLPFDSLLKJO. 

RLKl)FR\\a.FISDIWTTKHNFENLNQLAlCFISWLFRQLIPKIIQTFFYCTEISSTVTI 

\^FRHDT\VNKLITPFIVEYFKTYLVENNVCFa^}HNSYTLSNFNHSKMRIIPKKShn«JEFR 

[IAlPCRGADEEEFTIYK£NHKNAiQPTQKJLEYLRNKJU>TSFTKIYSPTQIADRJKJE 

FKQRjLLKJ<JT^NVLPELYFMKPDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTGVLKXFNVVNASRVPKPYELYIDhTVRTVHLSNQDVINVVEMEIFKTALVvrVEDKCY 

IRJEDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

nvJII<J<JLAMGGFOKYNAKANRDKJLAVSSQSDDDTVIQFCAMmFVK£LEVWKHSSTMN 

NFHIRSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKMISECYKSaF 

kdlsfnvtonmqfhsflqriif,mtvsgcpitkcdplieyevrftllngfleslssnts 
kfkdniillrkeiohlqayiyiyihivn 




Figure 47 

Oxytncha LCVSYILSSFYYANLEENALQFLRJ<£SMDPEKPETNLLMRJLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Figure 48 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCrATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCC CAAA ATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAA AAAT GCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAG'rT l'TAC TAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGAGACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGAC-AGAAAT^ 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



Figure 49 



AKPLHWLMSV^'VVELLRSFFYVTETTFQKNRLFFYRKSVWSiCLQSIGlRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 




Figure 50 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAG CTGCTC AGGTC 
TTTCTTTTaTGTCACGGAGACCACGTTTCAAAAGAACAGGCTCj 1 1 1 iCTACC 

ggaagagtgtctggagcaagttgcaaagcattggaatcagacagcacttgaa 

gagggtgcagctgcgggacgtgtcggaagcagaggtcaggcagcatcggga 

agccaggcccgccctgctgacgtccagactccgcttcatccccaagcctgacg 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

cagagaaaagagggccgagcgtctcacctcgagggtgaaggcactgttcagc 
gtgctcaactacgagcgggcgcg 




Figure 5 1 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSOSELlANVVKQMFDESFERRRNLLMKGFSNfNHEDFRAMHVNGVONDLV 
STFPNYLISILESKNWQLLLEI1GSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSK.KRJC 
. RTIETSITQNKSARKEVS\V^SISISRFSIFYRSSYKK^KQDLYF>ILHSICDRNTVKMWLOW 
LfNAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYMHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 
GlCRSNAKJvlCLSDFEKJlKQIFAEFIYWLYNSFIIPlLQSFFYlTESSDLRNRTVYFRJCDI^ 
SMKMEAFEKINEhn^RJvtDTQKTTLPPAVIRLLPKKNTFRLITNLRXJlFLIKM 

LRPVASiLKJ^LrNEESSGIPFNLEVYMKLLTFKXDLLKJiRMFGRKKYFVRIDIKSCYDRIKQDLMFR 
IVKKJa.ICDPEFVfRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKLEHLSGHIVKIGNSQYLQKVGIPQCSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
5 LRVVDDFLFITVNKJODAKJO^LNLSLRGFEKJrNFSTSLEKTVINFENSNGfr^ 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFTsfSCCNI 
rP YRLGYSMCMRAQAYLKJmKDIFlPQRMFITDLLNVIGRKIWKKJLAEILGYTSRRFLSSAEVKWL 
y LGMRDGLKPSFKYHPCFEOLIYQFQSLTDLIKPLRPVLRQVLFLHRJUAD 

Ln 



1=^ 
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ggtaccgatrtacmccmcncaiaagciaatigcttcctcgaacgctcctaaatctctggaaatamnacaagaactcaataacaataccaagtcaaanccaatatgaagg 

tgnanagigaicgataatamciaitmicggtcgnaccaaguiaaggacaaaaagaacaacnccnccccctaaagacrmacmattaamactmcaaau 

ggncgcnacrmaaicgiggiacigTTnagctgciacnctagccaaccgcgtgrnctaccccgicanggataiagctcnggagiagctcacagaaatccnacaaatcn 

ctgatgagactaLanagancartacagtccgigcatancnaacaiggagccrtacacmagatgagtcacgtcgcatgatggagiatnggiatcatccaacgmgccng 

aaaaggngaiaarumgcaaaatcaigtccnagtggiggiaatccgcgaaagTTtmgatgcngcacacgtctagcatgangagaiancaaaaamciaiccacu 

ctccmaacgcgg^TurTr^curmctattctca^g^gnccaaautgtatcatclcglaruggc^Tmccgrrmclccrggaatcgtacc^^ 

aataatctaaanagmcgcnataangatagtagtagaaaganggtganctactcgtgtaatgnanagmaaagatacmgcaaaacamanagctaicartauiaaaa 

aaaatcctataanataaaunaatcaatamgcggtcactatnamaaaacgnatgatcagtaggacacmgcaiatatatagnatgcTtaatggtucngiaacngcAT 

gaccgaacaccatacccccaaaagcaggattcttcgctttctagagaatcaatatgtatacctatgta 
ccttaaatgattatgtacaactt6ttttgagagggtcgccggcaagctcgtatagcaatatatgcgaa 
cgcttgagaagcgatgtacaaacgtccttttctal i i 1 1 cttcattcgactgtagtcggcttcgacagt 

AAGCCAGATGAAGGTGTTCAA I i I ILi ICTCCAAAATGCTCACAGTCAGAGgiaiatatatmtgrTngarmmciancg 
ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAG i 1 1 1 GAGCGTCG AAGGA 
ATCTACTGATGAAAGGG I I \ i CCATGgiaaggtanctaangtgaaatactiacctgcaanactgmcaaagagangtamaaccgataaagAA 

tcatgaagattttcgagccatgcatgtaaacggagtacaaaatgatctcgtttctacttttcctaatta 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACl 1 1 1 GTTAGAAATgtaaataccggnaagaigngcgcacmgaaca 
agactgacaagiatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTA i I 1 1 IGAGGCTCTTC 
CAAATGACAATTACCTTCAGATTTCTGGCATACCAC I 1 1 I i AAAAATAATGTGTTTGAGGAAACTGTGT 
CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCCCAAAGAAGTTTC 
CTGGAATAGCaTTTCAATTAGTAGGTTTAGCAI 1 1 i 1 1 ACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 
aactaaiacignaiccncaiaactaarmagATCTATA 1 I i i AACTTACACTCTATTTGTGATCGGAACACAGTACaCATC 
TGGCTTCAATGGA! i I I 1 CCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 
AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 
ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 
CACCCACG ATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCG i I I C i"I CGATC 
CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 
tangiataaaamanaccactaacgarmaccagACCTCGAAACi l I CTTGAAATTATCGAGATACGAGTC l l l lAGTTTAC 
ATTATTTAATGAGTAACATAAAGgtaautgccaaattTtmaccanaanaacaatcagATTTCAGAAATTGAATGGCTAG 
CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGA I I I IGAGAAACGCAAGCAAATATTTGCGG 
AATTCATCTACTGGCTATACAATTCGTTTATAATACCTAl I I 1 ACAATCi 1 1 1 I I 1 I ATATCACTGAATC 
AAGTGATTTACGAAATCGAACTGTTTA I 1 1 lAGAAAAGATATTTGGAAACTCTTGTGCCGACCCl 1 I AT 
TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtannaaagtatTrmgcaaaaagctaaiarrncagAACAA 
TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 
CTTTCGTCTCATTACGAATTTAAGAAAAAGA I lUl I AATAAAGgianaarmtggtcaicaaigiacmacnctaatciatiatiag 
cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAAC i I 1 ACGACCTGTGGCATCGATACTG 
AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 
TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaatiatataatgcgcganccicatianaatTngcagGCGTAAGAAG 
TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 
AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
~^~AGCTACA-AAAA-AeTTTGTTAGTGAGGGGTTTTCCTAin'gtaagm 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATT^ 
GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACAGATTCT 
TAAGgtataccaangngaangiaataacaciaatgaaactagATAGGAAA i I (J 1 CAATACCTTCAAAAAGTTGGTATCCCTC 
AGGGCTCAATTCTGTCATCl i 1 I I 1 GTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 
TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 
AGCATGCAAAAAAA I I I i I GAATTTATCTTTAAGAGglgagngcigtcancctaagnctaaccgngaagGATTTGAGAA 
ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATACI Mill i AATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTCACGAAAC 
ATATGGGGAAATCI 111111 1 ACAAAA I I C 1 AAGgtatacigtgtaactgaaiaatagcigacaaataatcagATCGAGCCTTGC 
ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 
CCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacnannaactagaaaagtcanaatiaaccttagATCi \ i i GAATGTTaTTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTACGATATACGAGTAGGCCTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgiacgtgtcggtctcgagacttcagcaaiaugacacaicagGCTTTTTTGTCTTGGAATGAGAGATCCTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTrrrGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcamtcaamaaataiacatcctt 

tanactggtgtcnaaacaaiattanactaagiaiagctgacccccaaagcaagcatactataggatnctagtaaagtaaaanaatctcgnanagtmgangacngictn 

aiccnatactmaagaaagangacagtggngctgactactgcccacatgcccanaaacgggagtggnaaacanaaaagtaatacaigaggctaatctccmcatttag 

aataaggaaagtggnnctataaigaaiaatgcccgcactaatgcaaaaagacgaagartaicnctaaacaaggggganaagcatatccgaaggaaaagagagiaaiat 

acccagtgngngaagaaagcaaggataamggaacaagcnctgcagatgacaggctaaannggtgaccgaatmggiaaaagccccaggnaiccaiggiggccg 

gccngciacigagacgaaaagaaactaaggatagmgaatactaatagcicamaatgtcnatataaggmtgmtncctgacttcaacmgcatgggigaaaagaaaia 

gtgnaagccanangganccgaaatagccaaantcnggncctcaaagcggaagtctaaagaacitangaagcnaigaggcncaaaaactcctcctgamaaaggag 

gaatcttccaccgatgaggaaaiggatascnatcagctgctgaggagaagcctaatrmtgcaaaaaagaaaatatcangggagacaictcngaigaaicagaigcgga 

gagtatctccagcggatccngatgicaaiaacnciamctgaaatgtatggtccuctgtcgcttcgacnctcgiagctctacgcagitaagigaccaaaggtacc 
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EST2 pep 

Eupiotea pep 

TraLns of tecrahymen 

Consensus 



FFYCTEISST VTrVYFRHDT VIU KLIT I' FIVE VTK-TYLVEN 

FFYVTEOQKS yjSKTYYYRKN IWDVI-MKMS IAD UOC ETLA--EVQE 

KKKE GSQIFWKKP IWKLVSKLTI VTCVRIQrSEK NKQMKNNFYQ 

FrY.TZ..:<. .S..YYVRK. IW,..-KL F..K V. . 



40 
44 

50 



EST2 pep 

Euploces pep 

Trans of cecrahymen 

Consensus 



NVCRNHNSY- TUSNFNHi 

KEVEEWKXSL GFA 

KIQLEEENLE KVEEXLI PED SFQKY: 

;c r . F 



HIIP.KK5NNE 




FEIXMPCRG 
ESlHiMTF.RX 



79 
78 
92 

100 



EST2 pep 

Euplotes pep 

Trans of cecrahyrr^en 

Consensus 



AI^EESTTiyX ENHKNAlQPTii: 

rv/NSDRKTTK LTT^^I^U^SjfHL^ 

DKQKNIK U^U^IU^S^QLVT- 

...... K K LN . N . . L . . S -^QL . L . 



QKILE^fiikNK FiPTSFTKIYS PTQIADRIKE 

HHFK -CPFGrAVFN 

viL-G -OXIGYSVTD 



. IG. -VF. 



129 
120 
130 

150 



EST2 pep 

Euploces pep 

Trans of cecrahymen 

Consensus 




NVL 

Z EFVCKWKQVG 
QFIEKWKI^KG 

. . . KWK . . G 



P = !: 



LiTTlKFD VKSCVD 
([jttePFATMD lEKCYD 
glTfVTL- 

31 

CifF.T.D . . .CYD 



157 
155 
158 

186 



.0 
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S-1; FFY VTE TTF QKN RLF FYR KSV WSK 

S-2; RC)H LKR VQL RDV SEA EVR QHR EA 

S-3; .^LT FRR EKR AER LTS RVK ALF SVL NYE 



A-1; AJCF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RJLR FIP KPD GLR PIV NMD YW 




Figure 55 



Vector 



Genomic DNA Insert 



AS 



5825 bp Sequenced 
-2 kb Hind ill Frogment 




Vector 



1 kb 



B 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 



introns 



2 5 4 



I <{:DNA 
3 ^ 5-2d 



CDNA 



i6 7 8 9 ion 



12 13 14 15 



Hmd lit Xcali 

Original PGR 
3* RT-PCR 



Xca I 



Hind III 




RT-PGR w/-M2-B14 _ 
RT-PGR w/ M2-B15 Be 
RT-PGR w/ M2-B15 Be 
RT-PGR w/ M2-B16 Bene 



500 bp 
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Polv 4 

t t c 

ta a g c c teg 
5'- cag acc aaa gga att cca taa gg 
QTKGIPQG- 



4(B') 



5 

D D Y L, L I T 

3'- ccg ccg acg gag gag cag egg -5' 
a a aaaaa a a 
t t t c 
c c 
Polv 1 
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Motif B' (4) 
QTKGIPQG 



Motif C (5) 
DDYLLIT 



Figure 58 



PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

Ot I/CVSYILSSFYYANLEENALQFIiRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGLCVSSILSSFYyATIiEESSLGFLRDESMNPENPNVNLLMRLTDDyi-LIT 

Sp_M2 SILSSFLCHFYMEDI.IDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPXVDLVYDDLIiEFYSEFKASPS QDTLILKLADDFI.I I S 



Qp K V G I P Q 

cg^ aaa get ggc acc ccc cag 

Palv 4 
m ^ t c 

a g c c ccg 

ca^ acc aaa gga acc cca caa 

ag acc aaa gga aci cca cca 

iq^ egg cc.c ccc caa ggc age 

K G ; P s 



GA^rGAT TTG ATT GAT GAA TAG 
CTT CTA AAC TAA CTA GTT ATG 

E D L I D E Y 



G 

gg < Actual 

gg > 

ggC TCA ATT CTG TCA TCT 
ccG ACT TAA GAC ACT AGA 

C S r L S S 

GTA TCG TTT ACG AAA AAG 
GAT AGO AAA TGC TTT TTC 

L ■ S F T K K 



Genomic Sequence. 



TTT TTG TGT CAT TTC TAT ATG 
AAA AAC ACA GTA AAG ATA TAC 

F L C H P V M 

AAA GGA. TCA GTG TTG TTA CGA 
TTT CCT AGT CAC AAC AAT GCT 

K G S V L L R 



— GTA-GTC— gac -ga c— c-a c— c-i^c—oCr^a cc— aec- 

CAT CAG ccg ccg acg gag gag cag egg 

^VDDYLLIT 



-- ccg ccg acg gag gag cag egg 

a a aaaaa a a 

c c c c 

c c 

Poly 1 



gac gac ecc ccc cec aca aca 
D 0 F L F I T 



< Accuai Genomic Sequence. 
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3' RT PCR Strategy 



mRNA 



5* 



I XAJUUUUUULAAAAAAA . 



\ Synmesis of cONA with Or Primer. 
mRNA 



First Round PCR Using Outside Primer ond Qq Pn^rier 




3. Second Round PCR Using Inside Primer and Qt Primer. 



4 Sequence 



Second Round PCR Products Using inside Primer or Q| Primer. 




Figure 60 



-Size Selected Libraries <rom P. Nurese 
3 - 4 kb 
5 - 6 kb 
7 - 8 kb 
n - 12 kb 

-Libraries from J. A. Wise 
Sou 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gel Activation Domain) Library 
REP Library from R. Allshire 

2 REPS 1 ES Library (old) 

REP81 ES Library (new) 

y REP41ES Library 



B 




-603 



D 



c 
o 
a 



o = 



©riginal PCR 
3' RT-PCR 




500 bp 



Mg Cone. 



0> ID 

<D o 
C CO 



c 

X 



< 

o 

u 

i< 

O Q 

O u 



603- 

310- 
234- 
194- 
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5' RT PGR Strategy 



5- 



mRNA 



I AJUUUUUULKAAAAXAA . 



ddA- 



1 . Synthesis of cDN A wim Specific Downstream Primer. 



mRNA 

5' m^^^^^mamam^mm^^^ma^^mmmmm^^mma^mmam^^mm kkjuuuuuuuuuu^jlxa , 



z::. 2. Ugote Oligo with 5*-P and blocked 3' to cDNA using T4 RNA Ligase. 



3. First Round PGR 




4. Second Round PGR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif O 

S.D. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRlA'YFRXDIW ...(35)... 

S.c. Est2p (366). WLFRQLIPKIIQTFFYCTEISSTVT-Iv'YFRHEm'y/ ...(35)... 

E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRXNIW ...(35)... 

Motif 1 Motif 2 K 

p nh n K hR h R 

S.D. Tezlp AVTRLLPKK--N"rFRLITN-LRKRF ...(61)... 

S.c. £st2p SKMRTIPKKSNNEFRIIAIPCRGAD ...(62)... 

E.a. pl23 GKLRlIPKK--TTFRPIMTFNKr:iV ...(61)... 
* ♦ « * ♦ ♦ ♦ 

Motif 3(A) .-\F 

h hDh GY h 
S.o. Tezlp :<KVFVRIDIKSCYDRIKQDLMFRIVK .,.(89)... 
S.c. Esc2p ElVF>QCFDVr:SCYDSIPFJ^CMRILK ..,(75)... 
E.a. pl23 :<1FFATMDIEKCYD5VNREKLSTFLK ...(107)... 

Motif 4 (B' ) 

hPQG pF hh h 
S.D. Tezlp YLOKYGIPQCSILSSFLCHF^-MEDLIDEYLSF ...(6)... 
S.c. Est2p YIREDGLFQGSSLSAPIVDLV^/DDLLEFYSEF . . . (8) . . . 
E.a. pl23 YKOTRGIPOGLCVSHILSSF^-^/ATLEESSLGF . . . (14) . . . 

Y Motif 5(C) Motif 6(D) 

r. r DDhhh Gh h cK r. 

S.D. TezlD VLLRVVDDFLFITVNKKD-^iKKFLNLSLR::'rE:<H:v7STSLEKTV . (205) 

S.c. Est2p LILKLADDFLIISTDQQQVINIKKUV^IGGFQ:<YI^■AK^^IRDKILAVSSQS . ( 173 ) 
E.a. Dl23 LlM?.lTDDYLLITTQENN.^VLFIEKLINVSRENGr?:F>n^KLQTSFPLS . (209 ) 
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Figure 65 

Disruption strategy for the putative telomerase genes. 
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1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 
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2. Assay in selective media. 

3. Sporulate. and grow on 
selective media. 
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(These ceils will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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7ez7 disruption causes progressive 
shortening of telomeres in S. pombe 
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Figure 68 



1 

met ser val tyr val val glu leu leu 

GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 20 

arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 

AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu Dhe Dhe tyr arg lys ser val trp ser lys leu gin ser ile 

CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 

gly ile ara gin his leu lys arg val gin leu arg glu leu ser 

GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

alu ala alu val arg gin his arg glu ala arg pro ala leu leu 

GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 

thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 

ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met aso tyr val val gly ala arg thr phe arg arg 

ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 



-1-3-0 ^ — 

ala ser val leu aly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 



160 



170 



phe val lys val * asD val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC" CCC CAG 



asp arg 
GAC AGG 



180 

leu chr glu val ile ala ser ile ile lys pro gin asn 
CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 




Figure 68 (com.) 



190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAG TGC GTG CGT CGG TAT GCC GTG GTG CAG AAG GGC GCG ATG 

210 

giy chr ser ala arg pro ser arg ala thr ser cyr val gin cys 

GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAG GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser chr leu leu cys ser 

CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 

CTG TGC TAG GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT' CGG 

250 260 

ara asp gly leu leu leu arg leu val asp asp phe leu leu val 

CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe lev r^rc thr leu val 

ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 

CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 

GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 

TTT GTT CAG ATG CCG GCC CAC GGC GTA TTC GCC TGG TGC GGC CTG 

330 

_leu__l_eu asp thr ar g thr leu_glu val gin ser asp tyr ser ser 

CTG CTG GAT ACC CGG ACC CTG GAG~GTG~CAG~-AGC-GAC^TAC-TCe— AGC 



340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 

TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AJ^C GGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 

TTC AAG GCT GGG AGG PJ^C ATG CGT CGG AAA CTC TTT GGG GTC TTG 

370 380 

a^g leu lys cys his ser leu phe leu asp leu gin val asn ser 

CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 



Figure 68 (cont.) 

390 

leu gin chr val cys thr asn ile tyr lys ile leu leu leu gin 

CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATG CTC CTG CTG GAG 

400 410 

ala cyr arg phe his ala cys val leu gin leu pro phe his gin 

GGG TAG AGG TTT GAG GGA TGT GTG GTG GAG GTG GGA TTT GAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu chr 

CAA GTT TGG AAG AAC CCA CAT TTT TGC TGC GGG TCA TGT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

GGG CTC CCT CTG CTA CTC CAT CCT GAA AGG CAA GAA' GGG AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 

GTC GCT GGG GGG CAA GGG CGC CGC GGG GGG TGT GCC CTC GGA GGC 

460 470 

arc ala val ala val pro pro ser ile pro ala gin ala asp ser 

CGT GGA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TGG 

480 

thr pro CVS his leu arg ala thr pro gly val thr gin asp ser 

ACA CCG TGT GAG CTA CGT GCC ACT CCT GGG • GTC ACT CAG GAC AGC 

490 500 

pro asD ala ala glu ser glu ala pro gly asp asp ala asp cys 

CCA GAC GGA GCT GAG TCG GAA GCT CGC GGG GAG GAC GCT GAC TGC 

510 

Dro cly gly arg ser gin pro gly thr ala leu arg leu gin asp 

CCT GGA GGC CGC AGC CAA CGC GGC ACT GCC GTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 

_CAT_ CCT_ G_GA CT^^T<^jGCC ACC CGC CC A G AG CCA GGC G GA G AG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 

ACA CCA GGA GCC CTG TCA CGC GGG GCT TAT AGG TGC CAG GGA GGG 

550 560 

arg gly gly pro his pro glv leu his arg trp glu ser glu ala 

AGG GGC GGC CCA GAG CCA GGC CTG GAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTGGAGCC.A.^GGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 




Figure 68 (contj 



AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 



0 
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Motif -1 

Eppl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

Hs TCP 1 ... YVVELLRSFFYVTETTFQKNRL... 

consensus FFY TE 

K 

Motif 0 phhh K hR h R 

Ep p 1 23 ...KSLGFAPGKLRLIPKKT-TFRPIMTFNKKIV... 

SpTezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRjaiFL... 

Sc Esi2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD.. 

Hs TCP 1 ...ARPALLTSRLRf IPKPD--GLRPIVNMDYVVG. 

consensus R PK Rl 

AF 

Motif A h hDh GY h 

Ep p 1 23 ...PKLFFATMDIEKCYDSVMREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCPl ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 

consensus F D YD 

Motif B hPQG pS hh 

Ep p 1 23 ...NGKFYKQTKGIPQGLC VSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCPl ...RATSYVQCQGIPQGSILSTLLCSLCYG... 

consensus G QG S 



Motif C 



h FDDhhh 



. • ; ■.i:.>£5fLFlT\^KKD. 
Sc Esi2 ...SQDTLiLKLADDFLIISTDQQQ... 
Hs TCPl ...RRDGLLLRLVDDFLLVTPHLTH. 
consensus DD L 



MotiTD — 
Ep pl23 
Sp Tezl 
Sc Esi2 
Hs TCPl 
consensus 



Gh h-cK- 



.NVSRENGFKLFNMKJCL... 
.LNLSLRGFEKHNFST... 
.KKLAMGGFQKYNAKA... 
.LRTLVRGVPEYGCVV... 

G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

1 0 1 AGCC ACTACC GCG AGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

25 1 CCCCCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

401 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 

451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 

50 1 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNtCCCAGC 

55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 

601 TCAGGCCCGG CCCCCGCC AC ACGCTANTGG ACCCGAANGC GTCTGGGATC 

65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 

701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 

75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 

SO 1 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 

90 1 CTCTTTGG AG GGTGCGCTCT CTGGCACGCG CCACTCCC AC CCATCCGTGG 

95 ! GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

1 00 1 GGG AC ACGCC TTGTCCCCCG GTGTACGCCG AG ACC AAGC A CTTCCTCTAC 

1 05 1 TCCTCAGGCG ACAAGNAC AC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCC AGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

1 20 1 GCGNTACTGG C AAATGCGGC CCCTGTTTCT GG AGCTGCTT GGGAACCACG 

1 25 1 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

1 30 1 GCGGTC ACCC C AGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

1351 TGTGGCCGCC CCCGAGG AGG AGGA ACACAG ACCCCCGTCG CCTGGTGC AG 

140 1 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

145 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

1 50 1 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 

1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

1 60 1 CGCTTGGCTG CGC AGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

1 65 1 ACCGTCTGCG TGAGG AGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

1 70 1 GTGTACGTCG TCG AGCTGCT CAGGTCTTTC TTTTATGTCA CGGAG ACCAC 

1751 GT TTCAA AAG AACAGGCTCT TTTTCTACCG GAA GAGTGTC TGG AGC A AGT 

1 80 1 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 

1 85 1 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

1 90 1 G ACGTCC AG A CTCCGCTTC A TCCCC AAGCC TG ACGGGCTG CGGCCG ATTG 

195 1 TGAACATGGA CTACGTCGTG GG AGCC AG A A CGTTCCGCAG AGAAAAGAGG 

200 1 GCCG AGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 

205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 

2101 ACG ATATCC A CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 

2151 GACCCCCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTG A CGGGCGCGTA 

220 1 CG AC ACC ATC CCCCAGGAC A GGCTCACGGA GGTCATCGCC AGCATCATCA 

225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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2301 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCC AGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTC ATGT GCC ACC ACGC 
2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 1 AAC AAGCTGT TTGCGGGG AT TCGGCGGG AC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
275 1 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGC AC 
2801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGG ACCTCC A TC AGAGCCAG TCTC ACCTTC A ACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
3001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGT GTGCT 
3101 GCAGCTCCC A TTTCATCAGC AAGTTTGG A A GA ACCCCAC A TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3201 AACGCAGGGA TGTCGCTGGG GGCC AAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
3351 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGC AGCCAAC CCGGC ACTGC CCTC AG ACTT CA AGACC ATC CTGG ACTG AT 
345 1 GGCCACCCCC CCACAGCC AG GCCGAGAGCA G ACACC AGCA GCCCTGTC AC 
3501 GCCGGGCTCT ACGTCCC AGG GAGGGAGGGG CGGCCC AC AC CC AGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCC ACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG GAGACCCTGA 
3 85 1 GAAGGA CCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TAC AC AGGCG AGG ACCCTGC ACCTGGATGG"GGGTCCCTGT GGGTC AAATT- 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 



I 




Figure 72 



ajrw-fX*GAC0CAGGACGACGCCT3CACCCrrC^^^^^ 



A A r. R P A A H V C i: H C i-' C H H K U A 

K C V L L T W E A L A ? A T P A M F - 

f, A A c c A R K p w r F. p p r R c K - 

A 5v P I . P £ P. A L ? A A Q P r. P F. A 

=: A P K C K A V K L 1- K i: H r K r. V T, - 

A L P A A E P- C A P C C A A T T A R C C - 



w 
m 
m 

y 

M 
I- 

O 

ry 



121 -- - 

.\ ;■, G H V ?i A A r l" A ? O, T. A A G A A P 
P L A 7 V K K L G ? 0 C v; F L V O S - 

K Vi f- 'rt S C C A W C r R A 'J W C ^' A - 



G;i^C3r::^o;'xrrTTCCGCGCCNT\:;cTCGCcc^-^rrso^ 
^^cjT^ ^ t * * • ;-4n 

n - G G F P ?: ? 0 ;T F* ? " -3 V ?. A L G ? 
L' :^ A A ^' .K A ? V A T C ^ V C V T W * 7 - 
7 F; L 3 A R W w K T A >: C A C r ? " - 



2^1 * * ♦ 300 

T:JCC•^^'a;crx*'Ji;criccc;GCGGAc;aAA'3':;cQr^ 



A A r ?. ? L L ? ? C V L P £ V r. G 

/ ? P F A. A P r R 0 V S C L V > L V A 

r: V y V F p p p r A P c ? A * T w w :- 



^.01 * * :'^-0 



P A > ? A V n r F'. R E R A C L ?. L S 

p. V L 7 ? r . C ? R G A r M V L A ? 'J ? A - 
t C C ■/ : C A ■> A A R ? T C W ? A £ R - 

:^fi ^ . - 

::-tjA::av.':rr>rct::cG-G::-:occf:'t:crjf^jccrj:;r-ic 



A A G S G r R G F P R G L H H 0 F: n f. 

L C G A r< G G P P E A ? T T S '/ K V 
C w ':• G P A G A r ' r R P f P P A C A A 




Figure 72 (cont.) 

^■,2^ . * * 430 

P A C* H 0 D R n T ^ G E R v o A a A A 

L ? M T V T I> A t. R G £ C A W »U L L L K - 
t;: F' T P. ' F T H C C ''J A f; k 0 G f.* ■ C C A - 



P R C Ft R R A 0 P A G f L r< r ? f: A 

K V C D D V L V H L L A R C A ? F V L V 
AWATTCWFTCWHAAR ? T. cww 



■ — * 

t:crVviX?7n:GAcrcrxiAT::GTT;c^i.cp.ccccm:^^ 



; 0 L R L ? ? V rt A A v p a R P <" ^ 

? l: c A Y r V t:: r P L Y 0 I- ^" A A T - 

r A APT? >: A t'J K K C '1* i* i.' A L .* L - 



TCtj>5rCCCCKX:CCCUCCACACCCTAiTO>:ACCCC^^ 



0 ? A ? A T S '* " T P P ^' N ? 

A R F' P f- n A ? o p f ? V w n c T »; :, 
R ? G P K H T r. ? O r ? A i C I O A 



o^:'vA;'.c^^TA«:;cGT::.Afji>:;AGrx:o-:corr:c^^^ 
C(rTT:i7rAT'-v-'r>LTTrcirrrrL"x-v^ 



G T L A i* C R ? C li ? W A 'S r C ? £ 

h; K * 0 G G R C P P 0 L ? A P O A R ?: - 

NHS V s E A G V P L G C 0 P F- V G - 

121 * ^ '^^^ 

— - — cocy:!ii:ii^jc^ZfJvrj<cx'^^ 

A p. G O C Q P k i A V A y E; A O A w K G 
K f J G ^' A r R S L P L P K R F K G A A 

;V & V r A F. V G R C PR T G V* \ L - 

781 - - ^ 

p A G A U p. K. w A t3 V L G F r G O O .~ 

p C P E R T P V fi 0 0 1^ W A H P G ^ "1' r - 
' r. R K G R r r. g v; p g p t R A G F 1 - 





Figure 72 (com.) 



TG<IV:TX;AoIX?Ac:X;GTtX»TTIt:TiJTGTGrjr^ 



w 



T E * P W r L C fi V T C 0 T P R R H 
; H S D F; li F C V V 5; r A R r A E E A T 
Cj P V T V V £ V w c H L P D P F K K F F - 



r 9 CO 



L F C C A L W H A F L F P T K r. r r A 

L E G A L S -J T K ri 5* H r £ V i;; R 0 n - 
LwkVRSLARATPTHFWAAST- 

tJCAO xrjiyv'xCvrccATcrACATO 

>r:i — 4 *- 1020 

r^TDSCCCCCryjC^UCCTAfJCTCTAOCrX^ 

P K C F ? I H r A A T T ? w D T r u 
H A 'j ? F '? T S P ? PR P G T P L V .P F. 
TPvArHrH?. '^HHVLCHALi; PC- 

Gr!^r^ACO^caIA.G/^ccAAGC^. ;^l ^■ ^t;:v:^ ^cTCC^^ z a^GGc^ 

1021 * * * ir.tsfu 

flAC/;T>'rGO'.rr-TrL>TrT\~LT7\';5A^";tTV';ATi^ 

V V A b. K K ? L <: D K ? T A / L 

f: T ? U T T •? T F 0 A T T r< . 

V K r. b O A L F L L L P. F 0 > H C ? ? - 

crrrC'r7AfnrAATATATCTt;At:;G'~ccHv';ccm';>.c^ 

1031 * ♦ n.irj 

L ? T Q V I • C i' A * L A C K F V t 'i* 

r L rJ • i h.- A Q F D W R £ C C S fc 

£ V I Y L R P L T C V R E V F G D V - 



rmrrTTCTr:-«yiTt:vA»x5<:cTr<30AT*3or^^ 
1141 -- ^ * i:oo 

? r L V r 0 L o c d ^: v o a f* f a f 
r: r v- q a i. o a p. r p r r t- r k r. r - 
I f. ^- ^ I* M p I- F r A .3 c r A r - 

t^^oi • • ♦ \?^r* 

A - A r: a a ? V r, w r p n /, v f- 

F V IV 0 R r L r L f: L I. ^:; N H A 0 * " • 



Figure 72 (com.) _ 

'.7rA^:'>>.'vrn:rrTccT\:A.^CACCC^CT^cc<2CTCca^o^ 

• — * ► * ♦ ijjo 

u L R '~ V Q U A J-i K A A. IT C Cj H K 5 S R 

b Y G V F L K T H C F' L n A A V T F A A I'i - 

C T I"*. C S S R R T A R C E L R r Q Q r V - 

T»rnnr/ItXXXS3GGAGAA0C^ 
1321 ♦ 1360 

C L C A P r, I. C G P R ■ 0 T 0 

b V r A K b: K F O C S V A A e E £ E E H P - 

r ^: V P G R S P R A T. W R r r p n n N T D - 

ACorrv:cTiTxx:cTCC7ro^ 
i3&a - ^ 1440 

^ T r V A w c 5 c r A £: T A 5- r t*: T A 

i. r r £ P C A A A P r A O Q P A G V ? L - 

t: I' K k L V O L L R p H S 1- P w Q V Y O F - 

1441 - - — 1^'5^> 

:r C P A c A c w c P Q A s c a r C; t T 
^^ H A f; L f A F a c A H H r L t. q a c p - 

V K A C L R R L V P e c L w r; z R h n E - 

.wTJcrr;rri'ixx.^ti2j:;aAA/-:ACC^ 

isoi - »- * "* i^"*'^ 

TP?o>xi:;AAGt3AtrixxTiOT2<nTC^^ 

..J N A A r, T P R ^? r w 0 £ tt P 'J :: 

1, T r r. P Q E H Q E V U L P G E A C V A L 

r p R F L k N T K K r* I £ L i~ K* H A V. L 'S - 

ovrnTLa.GaA'3ciCA07r^iA«i'xa.Tciv-;iTtrp.?cc>^ 

LS^l * - - * ^^-^ 

— .xviAOTirfrioiALTGCAariTcrAcrc^ 



R C ?. 3 • K O R * A i? C; T A L C C A C. A 

h A A G A D V E D E K A G L R L A A O P 

L 0 C L T W K M i V K D C A w L R :i u - 

CA''i'-^;r-i-it;Gcit7TtnTC03QrcacAi';;/?^ 

*• ^ * • II'SO 

0 w A \' z K e o £ T V i": v p p i i-.' r r - 
L- f. G v; L ': < 0 R K A r A ' D P -J O v 

c -:; V i*: c V p A A E H ? T. P E E I L A - 



0 

b 




Figure 72 (cont.) 



l^:rr^^x^vr^vy:c■^:l^;lCA»3^^r^T^Ao.^^ 

^ ^ - ^-; . * V C T S C n C S C L f M 

' F 'a \. ' a p c c 'v k r r a a 0 V r l l c h - 
LHWT. M^vyvvELLR2FFyvT- 
f,•'yiA^l^.ccA.a^TIC^A^Aa^AC^^ 

1741 . 4 • IKOO 

F. K f ?. ? K R T 0 ^ F :? T O R V r.* 0 A - 

0 C H V K E Q A L r L P E C C L . E 0 *^ - 

E T K Q K K R L F F Y P. K 5: V W i:: K L - 



Tt:XL\A.AC<^aUTi«-iAA.TCAC;ACAOCACT^^ 

^ ^ * * * 11 

A.t::GTrron'AAcXTrAC7TC7trrccrrGAAC)'ii. ii.Lr:Ar.crTCQACGccc7x7i?Atvvor.cTTt: 

C K A r. K S O S T ' H C C S C C S C R K 
A ic H W M 0 T A L E E 0 A A A C A V C i- - 
rj T r, T R Q H L K R V Q I. R F, T. F A - 



_ _ _ ^ .._«_ J. - f — — 

i7rxrr?iA(rn:rx'TC-':TACCctnTr:??n^xaox^ 



C- R £ G 3 I C K P C P P C * K r O S A r 
*K .J 0 A A S C -J 0 A R F A D V O T L H - 

E v/k cj K R E a P P a L L T S R L R F I - 



c i> y L. *r 'i. C rj K L * 1- W T T ? W F P F: 
p A. • . ?. A A A D t- E H C L R R O Q 

p K P C» o L R P I V W M C> V V V .J A K T 

-l^£l-^-— . ^^^^ 

p. u; A E r c r s v ^; p h: c ♦ r h c a 

V' p 0 R K. E G R A I- H L E G E C 'i' v n R . 
F ft S E K* K A F R L T S R V K A L F 1^ V - 

j041 - - 2100 

At:\';Ai7rT\V^TC-CT?\?::COX:GtX(XXiXCC^^ 

r Z T T 0 F: A P A ^ W A F' L C V/ A W - 

A L R A A A P K r O R f. C A t": f- r. - 

L S V e R A F: P. P 'j L L f; A ^ r. - . 



-101 — « * * - -i'?'' 

T I T C C A H S C C V c P R T p. Ft 

n Y F 0 L A H L R A A C A C P C F A A - 

C I H R A w ;i T F V L R V r< A L' F* - 

ClCAtA-ntTTACTrKrra^Gt'r^^ 

T . f c T L S P: w M * R A r- T T f n r R T 
h • A V L C 0 0 G C D 0 R V R H H r r C Q 

:: £ L Y r V K V D V T G A Y D T i O D K - 

CCCA.TIXXVlO^ACTAGCGGltXrrAiTrAGT^^ 

£ R H £ £: A S ^ N f F: T P T A C: 7 t^: 
r. A H C C H R 0 H H 0 T ? E K V L R A V - 

r. L T C V I A 5 I I K r 0 N T Y C V n R Y - 

ATC:CCCTi^ICX.ilCLaAC/^JCtXX.-ri?*.7aj\^ 

— «- -r ^ — 2'<qo 

TACf>7ZAcc^o<rnrTrcoi!t:^x;cTAcr^ 

M P w S R R ? P M T S A H f R A T- i:.- 

b C R O H £ C R p W A R ? 0 C L 0 E R L - 

A V V 0 K A A H H V K K A r K i H V r 

cTAccrr^cvjAfjr.'ix:r::A;x::frrACA.Ttx-t::ACA^ 

♦ • - ?^nn 

L ? ' 0 T ? I' R T c r- ? ^; w L T i:. R •■ T 
o Y L t' ?• r A V K T V R c r r ? - 

f.- T r. T D L O P r M K V i- V/ A H L N o - 

r;rjrjaXTrjA'X'::;i.Tca:-GTOyR:^^ 

■^401 + * • ♦ ♦ .1*460 

o.-;ci.-MiGArrtx-x;ci^ACXK;asi^Ai^^^ — — — - — 

A R • C M K :i b* S i' R A ? P * M Fl f' A v 
N f A e 0 C P R H R A E L L F- E • ''i 0 0 W - 

F- L R D A V V I E Q L N E A :: G - 

^JCTTIt-JTIXiTlACCTrnTCCTAClICTTCATC^ 
j^vl * ♦ ^ • * iZ'-^u 

A S i *r i Y A i' C A 1* i- f C A i: C A i 

b P L R R L ? T L H V p p R R A H -J V - 

c L D V k- L K F M C H H A V P T K <- K V - 



Figure 72 (cont.) 



F- T ^: r: A P G • ?: r n a f ■ F* F: C A A 

L ^ P V p C D P A I*. L K r r. K A A L 0 F' - 
Y V 0 C 0 c I P O C i L i r L L r c r. .. 

j-co^ ..^ ^ ♦ 2f>A0 

ACAUl^T' iCCGCrvTrA CCIf. ' X ' UV 1 ' 1 CCUCA.aACg: C CCr aj^CC C GCCCTCCCCCftCuACC 

T A T w K T i C L R C F C G T C C S 

V L R R H 0 £ Q A V c G D A G P A A F - 
C Y G D M F: N K L ? A C I R K D C L L L - 

Tc;ccrTTTi:>7r:;GAicvmxnT7rTO^ 

-* T T * ♦ 2700 

Aa';cAXJvrcarcTA'rTAJ^.AaAACAj*.cc;i/j"ivica^ 

C V W W M r £ C W ' H L T 3* H -r K K P i,* 
APiV.Jv'FT. VCD■Ti;^H^RENL^- 
^ L- V D D r L L V T f H L '1' H A K T y L ' 

TC.^GaACCCirjr7ICCT:A0»rrCTCCCT 

^7U1 r r ♦ * -f I'TtO 

AtTTT^'TTCa'iarC-^SGClCCiVC^O'iGACItlV 

i C F W :v H V r: L M A /i W • T C 'i P 0 
0 D P r r< c r • v v; r. ? c; e l a f: • "D «: 

p T I. V » i*; V P F V L~ C V V N L K K T V . 
'lCfj'Iv^.rriTCCCTC,TA<'iAA/;AC:t3Aa:CCCT^ 

ACCAcrT:3AA*i<;<v.CATCTTCTGcix^ - - v--^ rrrr^ zriy^n 

W ' ■ T F f . ' K T K K W V K L L K C H 

EL? C K R R r; ^- c W H C F C 2 D A G - 

V N F F* V . c cj € A r.. a o T A r V ( j H r A - 

— + ♦ 1 2660 

? T A V ? O A A '1 C W I ? C- F W R C R 

R ? I L V R R A A t> Y F' L> F' A E - 

II L " ? W C G r. L L C T K T I. F V Q - 

cccAcrAcrcrA/jrrrATccccccikrcTctATCAcii.cc^^^^^ 

JLU'l * * ♦ j:>40 

A T T R A M R G P P ? E V V :i p i: T A A 
R L . L 0 L C F- D L H 0 0 11 L 0 F' R L - 

n V ? £ Y A R T £ T s A £ I. T r N p. r - 



Figure 72 (cont.)_ 



'j'jAi - r ^ ::00i' 

:t F, L C; -3 T C V A N i L C C ' i V T 

L Q (J U £ 1- H A £ O T L W C L A A E V i O 

K A O R N M P R V: T. r G V t. R f. K C H :* - 

'•^*ci''rrTTCT\^aj^.TTrcc^.aJTCA.Jk^ 

:*001 -r ^ ♦ -r * 30f.''' 

A t" r w 1 c R ' T A K c A i' I* :i *r K 
I* V J C F A G E 0 f P D V H 0 H L O C' - 
O L F L D L 0 V N S I. T" *^ «* TNT Y K T - 

iccra:rr;cTO:Aoot:rrrACA^?GTrn:AC^ 

jO(ri — : ^ * ♦ - + ♦ J\.:0 

^ -j: C C R R T G r T H V C C i- £ H L t* 

s r r A A ^* \/ 0 V s R M t: a a a p i >: a - 

r L I. O A y Fi r H A C V L •:■> L F ^ H C' - 

yizi - r ' jii/o 

TTCA;iAccrTt:rTCi5^crixrr.2^^ 

r r* «: R T ?• ir r 1* c A .0 ■> l t r< r f a 

rj 1: L E E F H I r F A K H r. • H G L P L. L 

c V V; K' N r T " F L F. V I S D T A L C Y 

ACTCcii.itYj-icA>j;a:c;^jic;iACt:cAa;cA.TC 

Jltl — - - -1240 

T>3AG*7rv^j.^ciu'i^TjrjTitrm^xri^:rxTAcAccc^^ 

•> T P • K F n T 0 C F: V/ 0 K A r P A 

n L H r Q ^ p. D V A t; c C! G K p. ?. r 

Z I L M A K. N A 'J M S L C A K C A A r - 
3241 - 

- — .-t— ? — P— P — P— — r, C A"^T~"V"l4"~7^ C :? ' f. 

r> A i. R G R A A V P P i I r* A O >. O 

c L ? S E A V 0 W L C H 0 A F L L K L T ?i - 

•»7r!i A ♦ 4.-.-... * - :i.:r.o 

cl^J•lV7;c*ct/J'I^Jf:ATr:cAca:ltla,fXACXXTlvr^^ 

C T V- r P T C H 5: w o H r G O F P P :T • 

i, T p c H 1 R A T r G V T 0 0 >: p r- a c 

H S V T V V F L L G r L F: T A O T T. - 






Figure 72 (cont.) 



V o r •<•.■; R ? ' r w f. r Q r X p n c 
i e A F G D D A D c p C P. £ v '"^ T 

K L ? 0 T T L T A L e A A A N A L ? 



'rcTCACACJueft.e/^ircAucciT^;atc-iT^vro:;ccp.ccrGC^ 



r. r s w T n i" H r r T A F: ? p. A 

D H P .3 L H A T R t> O T C T ■; 



^: C» F K T I L D ' w K f A ri i.- y A f :^ F 
.I^AC^.CCC^'C^tyjrjcCTTrrCAaXtri^OOCT^ 



"1 •\ »i 1 



>40 



D T s s ? V T P 0 r, T 5: C: C G K G G k> H 
T r* A A L- .*> R R A L K K £ G G A A H T 

H o c H A c L Y V p G n r. r, p. r t r 

CwTrrr'^^C7r3GCfi^.CX.T/rCVJAfnCOxiACTC^ 

K P H ?. W E .> E A • V 5 V w F K P A C 

... A r T A G £ L r. P i; • V F C P. G L H V 
h A : L O V • i:; L 5 E C A £ A C M Ir' 

crrryxT^AAt^vrrt^Arrn.TTrxT^^ 



it*. 00 



O'.TO.^C^CTTCTZO^.CICAtJP/JfJCrCACTC 

A £ C' • - ' - - 

F; K A r C P - ^ r. ' A r V V ; 



K A £ ^• 



_^TCCP.'y^-/;ACCT»l^;'C^n'CTT(I:ACTnXCCACA>.> 
At3trpr'rr7ITX;ACGrjr^.GAAJJT^It^^ 

2 r P A V F T F 0 A G A P [. H r K A 

P A H L P f: r.. p H R L A L c ^ '1* P P 
0 H T r. p. T. H F I' "l- Q w f: s A P P Q G 'j 



7^:aVv-uNGt;ACTC/TTrt7n:rA-iTccr:AA<^TDii.cGOT 



Z r S :C P G A p. L L P T ' r; • T r* P 

A F p H 0 E P G F H 5: r K R N 1^ p i; p 
T, - L T K i- y A i; T P H r 'i I V II ; Q : 




Figure 72 (cont.) 



r A I V I i F- >: r c n r L K i> ■ I- P T r 0 v 
•i ? L r T r K A L L c L r F- r vj . 

P il S n L A L ? G r A F H r 1! ;I ?' -* - 

•r-n ^ * 3>00 

T K ?; T L r: a l v - t w x d o f. ■-: a l 

K r • E 0 f W E r W E F V T K »J V f 

£t E >: D r G z z c N L £ ' P K v t;- r v - 



::oui * * * 'i^r-Ct 

A^xrTcrircccicciTjtUi3ACcrjc;»/jcrACXCCc 

V T <*, ?: D A c- G w o, ^; r, w *- k l 0 

T O A K *r L H L Li C I' C" C £ N W ' " 

H R R ? t: T w M C V p V .J t.) 1 f; ■■; k c 



. ^ • . ^QjV 



A V .-j ^' K V ' r Y E F r i ? E K >: H K K 
L a' r ' T E r M 1- F V L K K K K K K 

C N A T T. N I " V F (J F ' K K K K K :^ 

4021 ;o?ri 

TTTTTTTTT 

K. K K 
Y K 



Figure 73 



DRA3 2 25 





Figure 74 



met 

GCAGCGCTGCC:;TCCTGCTGCGCACGTGGGA.a.GCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

vro arc ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CC-C AGC 



20 



30 



his ty^ arg clu val leu oro leu ala thr phe val arg. arg leu 

CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

cly oro gin gly cro arg leu val gin arg gly asp pro ala ala 

GGG CCC CAG GC-C TGC- CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 



6C 

p.^e arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CC-C GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

.70 

ala arg oro oro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CC-C CAG GTG TCC TGC 



°0 



90 

i=u Tys giu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

-g-lv- a4-a~l-ys- a sn-v:a l_l.eu_a 1 a_pjie_gl}^phe^la^j^ 1^ sp_g ly^ 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe nhr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



1 J u 

cyr leu pro asn thr val chr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 ISO 
crp glv leu leu leu arg arg val gly asp asp vai leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAG GAG GTG CTG GTT CAC 

150 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGG GCG CTG TTT GTG CTG GTG GCT CCC AGC TGC 

170 ISO 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 

GCC TAG CAG GTG TGC GGG GCG CCG CTG TAG GAG CTC GGC GCT GGC 

190 

zhr gin ala arg pro pro pro his ala ser civ pro arg arg arg 

ACT CAG GCC CC-G CCC CCG CCA CAC GCT ACT GGA CCC CGA AGG CGT 

200 210 

leu glv cys glu arg ala trp asn his ser val arg giu ala gly 

GTG GGA TGC GA.^ CGG GCC TGG AAC CAT AGC GTG AGG GAG GCC GGG 

220 

val pre leu gly leu pro ala pro gly ala arg arg arg gly gly 

GTG CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 

ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 

AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

-ala-ala-orc_qlu pro glu arg thr pr o v al gly gin gly ser nrp 

GCT GCC CGT GAG CCG GAG CGG AGG CCC GTT GGG CATG^C-GG-TCC-TGG- 

260 270 

ala his oro gly arg chr arg gly pro ser asp arg gly phe cys 

GCC CAC CCG GGC AGG AGG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala chr ser leu glu 

GTG GTG ?CA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 
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290 3C0 
giv aid leu ser gly chr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 



310 

his his ala glv pro pre ser thr ser arg pro pro arg pro 

CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CC-G CCA CCA CGT CCC 

320 330 

iirp aso chr pro cys pro pro val tyr ala clu chr lys his phe 

TGG GAC ACG CCT TGT CCC CCG GTG TAG GCC GAG ACC A.aG CAC TTC 

540 

leu rvr ser ser gly asp lys glu gin ieu arg pro ser phe leu 

CTC TAG TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 

ieu ser sc*: T -y ^-g leu vai 

CTC AGC TCT CTG AGG ••^-•C GCT CGG AGG CTC GT 



-^G 



5"0 

glu Chr lie phe leu gly ser arg pro crp rr.ec pre gly chr pro 

GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 

aro arg leu pro arg leu pro gin arg cyr crp gin rr.ec arg pro 

CGC AGG TTG CCC CGC CTG CCC CAG CGC TAG T-GG CA.^ ATG CGG CCC 



400 

Isu Dhe ieu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAG GGG 

410 420 
val leu Ieu Ivs chr his cys pro leu arg aia ala val chr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT . GCG GTC ACC CCA 

430 

ala ala gly \'al cys ala arg glu lys pro gin. gly ser val ala 
CtCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 




Figure 74 (cont.). 



440 



450 



ala pre glu glu glu asp chr asp pro arg arc leu val glrx leu 
GCC CCC GAG GAG GAG GAG ACA GAG CCC CGT CGG CTG GTG GAG CTG 

460 

lou arg gin his ser ser oro lirp gin val cyr gly phe val arg 
CTG CGC CAG CAC AGC AGC CCC TGG CAG GTG TAG C-GC TTC GTG CGG 



470 



480 



ala cys leu arg arg leu val pro pro gly leu crp gly ser arg 

GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

h"s asr. glu aro ara one leu arg asn chr lys lys phe ile ser 

CAC AAC GAA CGC CGC TTC CTC AGG A_AC ACC AAG AAG TTC ATC tCC 

300 510 

Isu criv Ivs his ala ivs leu ser leu gin glu leu chr crp lys 

CTG C-C-G AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG AGG TC-G AAG 

520 

_-no^ s^- va.1 arg asp cvs ala crp leu arg arg ser pro gly val 

ATG AGC GTG CC-G GAG TGC GCT TGG CTG CGC AGG AGC CCA GC-G GTT 

530 

gly CVS val oro ala ala glu his arg Isu arg glu glu ile leu 

C-GC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala Ivs ohe leu his crp leu mec ser val cyr val val glu leu 

GCC AAG TTC CTG CAC TC-G CTG ATG AGT GTG TAC GTG GTG GAG CTG 



560 570 

ieu arg ser phe phe cyr val chr glu chr chr phe gin lys asn 

CTC AGG TGT TTC TTT TAT GTC ACG GAG ACC AGG TTT CAA AAG AAC 

580 

ara leu ohe phe Cvr arg pro ser val crp ser lys leu gin ser 

AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC A_AG TTG CA-A AGC 



590 



500 



ile giy 'la arg cln his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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61C 

= 6r ci'- ala glu val arg gin liis arg glu ala arg pre ala leu 

7CG G.^-^ GCA GAG GTC AGG GAG CAT CGG GAA GCC AC-G CCC GCC CTG 

620 

■eu chr ser arg leu arg phe ile pro lys pro asp gly leu arg 

CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

i:ro ilc val asn met asp zyz val val gly ala arg chr phe arg 

CCG ATT GTG AAC ATG GAC TAG GTC GTG GGA GCC AGA ACG TTC CGC 



650 



660 



arg alu Ivs ara ala glu arg leu chr ser arg val lys ala leu 
AGA G>A a\g AC-G GCC GAG CGT CTC ACC TCG AC-G GTG .AAG GCA C 



670 

phe ser val leu asn tvr glu arg ala arg arg pro gly leu leu 
AGC GTG CTC A^C TAG GAG CGG GGG CGG CGC CCC GGG CTC CTG 



1 



680 o50 

aly ala ser val leu gly leu asp asp ile his arg ala crp arg 

GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

tihr che val leu arg val arg ala gin asp pro pro pre glu leu 

TTC GTG CTG CGT ,GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 



710 



720 



tvr phe val Ivs val asp val chr gly ala cyr asp chr ile pro 
TAG TTT GTC AAG GTG GAT GTG ACG GGC GCG TAG GAC ACC ATC CCC 



730 

aln asp arg leu chr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC .^^AJl cCC CAG 



740 



750 



asn chr .yr cys val arg arg cyr ala val val gin lys ala aia 

AAC ACG TAG TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 _ ■ 

his gly his val arg lys ala phe lys ser his val ser chr leu 

CAT C-GG CAC GTC CGC AAG GCC TTC A^G AGC CAC GTC TCT ACC TTG 
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chr asp leu 
AC A GAC CTC 



giu thr ser 
GAG ACC AGC 



ser leu asn 
TCC CTG .AAT 



phe nec cys 
TTC ATG TGC 



gin cys gin 
CAG TGC CAG 



cys ser -eu 
TGC AGC CTG 



ile arg arg 
ATT CGG CGG 



770 

gin pro cyr 
CAG CCG TAG 



pro ieu arg 
CCG CTG AGG 

500 

giu ala ser 
GAG GCC AGC 



his his ala 
CAC CAC GCC 

S30 

giy ile pre 
GGG ATC CCG 



cys cyr gly 
TGC TAG GGC 

360 

asp giy leu 
GAC GGG CTG 



mec arg gin 
ATG CGA CAG 

790 

asp ala val 
GAT GCC GTC 



ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGG ATC 



gin gly ser 
CAG GGC TCC 

850 

asp mec giu 
GAC ATG GAG 



leu leu arg 
CTC CTG CGT 



phe val ala 
TTC GTG GCT 



val ile giu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC PAG 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



leu val asp 
TTG GTG GAT 



780 
his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 

phe leu arg 
TTC CTA CGC 



ser cyr val 
TCC TAG GTC 

84 0 

chr leu ieu 
AGG CTG CTC 



phe ala gly 
TTT GGG GGG 

370 
asp phe leu 
GAT TTC TTG 



ieu 
TTG 



ieu 
CTG 



val 
GTG 



Chr 
ACA 



vai arg 



pro 
CCT 



his 



CGA 



390 
gly 
GGT 



val 
GTC 



Isu 
CTC 



Chr 
ACC 



pro giu 
CCT GAG 



his 
CAC 



cyr 
TAT 



880 
ala 
GGG 



gly 

GGC 



lys 
AAA 



cys 
TGC 



chr 
ACC 



val 
GTG 



phe 
TTC 



val 
GTG 



^eu 
CTC 



arg 
AGG 



asn 
A.AC 



900 
leu 
TTG 



cnr 
ACC 



arg 
CGG 



910 

lys chr val val asn phe pro val giu asp giu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT C-GC 



A. 



920 



930 



Chr ala phe val gin mec pre ala his gly ieu phe pro crp cys 
CG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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94G 

giy iGU leu leu asp thr arg chr leu glu val gin ser asp cyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG GAG AGC GAG TAG 

950 960 
ser ser r.yr ala arg chr ser lie arg ala ser val thr phe asn 
TCC A-GC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

=-2rg gly phe lys ala gly arg asn mec arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC A.TG CGT CGG AAA CTG TTT GGG 

980 590 

val leu arg leu lys. cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG -^J\G TGT CAC AGC CTG TTT CTG GAT TTG GAG GTG 

1000 

asn ser leu gin chr val cys zhr asn ile cyr lys lie leu leu 
AAC AGC CTG GAG ACG GTG TGC ACC A-AC ATC TAG A_AG ATC CTG CTG 

1010 - 1020 

leu gin ala cyr arg phe his ala cys val Isu gin leu pro phe 
CTG GAG GGG TAG AGG TTT CAC GCA TGT GTG CTG GAG GTC GGA TTT 

1030 

his gin gin val crp lys asn pro chr phe phe leu arg val ile 
GAT GAG GAA GTT TGG AAG AAG CGG AGA TTT TTC CTG GGC GTG ATC 

1040 1050 
ser asp chr ala ser leu cys cyr ser ile leu lys ala lys asn 
TGT C-AC ACG GCC TCC GTC TGC TAG TCC ATC CTG AAA GGG AAG AAC 



-» ^, /• o 

J. J o u 

ala gly -iet ser leu gly ala lys gly ala ala gly pro leu pro 

GGA GG-G ATG TCG CTG GGG GCC J\AG GGC GCC GCC GGC GCT CTG CCC 

1070 1080 

ser glu ala val gin crp leu cys his gin ala phe leu leu lys 

TCC GAG ■ GCC GTG GAG TGG CTG TGC CAC GAA GGA TTC GTG CTG AAG 

1090 

leu chr arg his arg val chr cyr val pro leu leu gly ser leu 

CTG ACT CGA GAG CGT GTC ACC TAG GTG CCA CTG CTG GGG TGA GTC 



4ft 
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1100 



1110 



arg chr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 



leu thr ala leu glu ala ala ala asn pre ala leu pre ser asp 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 



phe lys chr lie Isu asp O? 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTC-GCCGAGGCCTGCATGTCC 

GGCTGA-AGGCTGAGTGTCCC-GCTGAGGCCTGAGCGAGTGTCCAGCCA-AGGGCTGAGTGTC 

CAGCAC ACCTGCCGTCTT ^^Vi T _ :.CCACAGGC TC-GCGCTCC-GCTCCACCCCAGGC-CCAG 

CT7TTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGG.AATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGC-ACCCTGGGAC-CTCTGGGAATTTC-GAGTGACCAJ^-AGGTGTGCCCTGTA 

rACAGGCGAGGACGG:?GCACCTGGATGGC-GGT CCCTGTGC-GTC.AA.AT~GGGC-GGAGGTGC 

•rGTGGGAGTAJLA.ATACTGAATATATGAGTTTTTCAGTTTTGR.AAAJ\A-A^AAJiuAAA.AAAA 



1120 



1130 



1132 
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